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(57) Abstract 

The invention provides for meth- 
ods for identification of biologically active 
biomolecules. In one aspect, a biologically 
active biomolecule such as RNA or a pep- 
tide is identified by incorporating random 
nucleotide sequences in a scaffold consti- 
tuted by an enzyme activity modulator, 
transforming substantially identical host 
cells with the construct obtained thereby 
and screening the transformed cells 
identify those where a preselected pheno 
typic trait has been altered. Hie random- 
ized DNA is subsequently isolated from 
the phenotypically altered cells and the 
peptide and/or RNA encoded by the ran- 
dom sequence is determined. In turn, in- 
teraction partners which are putative drug 
targets are identified and isolated by use of 
the peptide and/or RNA as part of affin- 
ity reagents. A preferred scaffold is de- 
rived from the potato inhibitor I family of 
protease inhibitors and exemplified is the 
barley chymotrypsin inhibitor 2A (CI-2A). 
Another aspect relates to the identification 
of novel enzyme inhibitors by using sub- 
stantially the same approach, but screening 
specifically for changes in target enzyme 
activity. Also disclosed are methods of 
producing the relevant transformation and 
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Novel methods for the identification of ligand and target biomolecules 
FIELD OF THE INVENTION 

The present invention pertains to a novel method for the 
Ldon t i f i ca t ion / prepa ra t ion of pept ides or r ibonu oleic acids 
5 ca pa b 1 ! j o f in^'dii 1 u ^ i n g t h e a c 1 1 v 1 1 y _i n v i vo o f t a rQe t c n z ynie s 
in eukaryotic; cells. More specifically, the invention provides 
a method for identification/preparation of hitherto unknown 
enhancers as well as inhibitors of in \ r ivo enzyme activity in 
eukaryotic cells. Furthermore, the invention relates to me- 

10 thods for identification of unknown interactions (i.e. identi- 
fication of a target and/or a ligand but also of hitherto 
unknown interactions between known ligands and known tarqets) . 
These novel methods employ enzyme inhibitor structures as 
scaffolds in order to i nt race! lular ly display potentially 

15 biologically active peptides or ribonucleic acids in a stable 
form. Also disclosed herein are methods for the preparation of 
the hitherto unknown Ligands or targets as well as methods for 
the preparation of vectors and transformed cells carrying the 
genetic information encoding these ligands and targets. Fi- 

20 nally, the invention relates to a method for the preparation 
of a medicinal product which is based on initial identifica- 
tion of targets or ligands according to the present invention. 

BACKGROUND OF THE INVENTION 

The CellScreen™ technique is a method which allows for the 
25 identification of peptide sequences having biological activity 
in vivo and which is disclosed in WO 96/38553. In short, 
libraries of random peptides are expressed intracellularly in 
eukaryotic (eg. mammalian) cells, such that one cell expresses 
one single or a few heterologous short peptides. Cells that 
30 change a preselected phenotype under certain conditions can be 
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isolated and the peptide that they express can hence be iden- 
tified. The intracellular component with which the peptide 
interacts (the target molecule) may subsequently be obtained 
using e.g. affinity columns carrying the immobilized synthetic 
5 peptide. 

Although the CellScreen™ technology has shown great promise 
for identifying new drug targets, it is an inherent problem 
that the intracellular environment is relatively hostile to 
many heterologous expression products. In other words, inter- 
10 esting peptide or nucleic acid sequences which potentially are 
capable of interacting with an important target molecule may 
be degraded or inactivated inside the cell before any effect 
on phenotype can be detected. 

OBJECT OF THE INVENTION 

15 It is an object of the invention to provide improvements in 
the CellScreen™ technology by overcoming the above-mentioned 
problems of potential instability of expressed sequences. 
Furthermore, it is an object of the invention to expand the 
utility of the CellScreen™ technology to also encompass 

20 screening in prokaryotic cells. 

SUMMARY OF THE INVENTION 

A significant number of enzyme activity modulators of plant, 
microbial and eukaryotic cell origin have been described, cf. 
below . 

25 Since many of the naturally occurring processes inside cells 
are regulated by enzymes, the inventors disclose herein a 
method for expression of large intracellular libraries of such 
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enzyme activity modulators in which the active site of said 
enzyme activity modulators have been altered ley .introduction 
ot stretches of randomized amine* acid segue nee::: or by intro- 
duction of random nucleotides at specific sites in the active 
5 :u:;e. This creates libraries of putative modulators capable of 
modulating the activity of an array of different enzymes 
inside cells. By expressing these modulators in cell lines 
according to a novel variation ot tne Co i L Screen'" technology 
the enzymatic regulatory mechanisms inside the individual cell 
10 in said cell line will be affected differently leading to 
d i f f'e rent phenotypic properties such as e.g. res i stance to- 
wards hypoglycemia, cytokine killing, toxic compounds, virus 
infection etc. 

The main advantage of using known enzyme activity modulators 

15 such as enzyme inhibitors as scaffolds is that many of these 
in their native form are stable in the intracellular environ- 
ment. The problem of using e.g. antibody fragments as scaf- 
folds for intracellular presentation of random peptides is 
that many such antibody fragments are susceptible to the 

20 proteolytic and reducing intracellular environment and there- 
fore are unsuitable as intracellular scaffolds. On the other 
hand, enzyme activity modulators can , if carefully tailored, 
maintain their intracellular stability and at the same time 
incorporate random sequences which are screened for biological 

25 activity. Furthermore, the effectivity of such a screen for 

biologically active substances will be higher than if using an 
unstable scaffold (such as a e.g. a coiled coil structure) or 
no scaffold at all, since none or only a very limited number 
of the randomized sequences will be degraded before they can 

30 exert their effects in the cells. Finally, a majority of such 
enzyme activity modulators have an active site which is the 
perfect position in the molecule to modi f y , since the active 
site is normally presented in a stable configuration to the 
environment . 
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Hence, one aspect of the invention pertains to a method 
for identifying an in vivo active modulator of activity of a 
target enzyme, the method comprising the steps of (a) pre- 
paring a pool, of expression vectors, each vector of said pool 
5 containing at least one member from a library of randomly 

modified nucleotide sequences derived from a parent nucleotide 
sequence encoding a parent peptide or parent ribonucleic acid 
which modulates the target enzyme activity, (b) transforming a 
population of substantially identical cells with said vectors 

10 of said pool so as to obtain transformed cells, said substan- 
tially identical cells being ones which harbour the target 
enzyme, (c) culturing said transformed ceils under conditions 
facilitating expression of said randomly modified nucleotide 
sequences, (d) examining said transformed cells and isolating 

15 transformed cell ( s ) wherein the activity of the target enzyme 
is modulated, and (e) identifying the modulator by determining 
said randomly modified nucleotide sequence of said vector 
present in cell (s) isolated in step (d) and/or determining the 
amino acid sequence or the ribonucleic acid sequence of the 

20 expression product encoded by said randomly modified nucleo- 
tide sequence . 

In this aspect of the invention it is normally preferred that 
the randomly modified nucleotide sequences consist of 1) an 
invariable part of the parent nucleotide sequence, and 2) 
25 random nucleotides. In line with the above, the invariable 
part of the parent nucleotide sequence preferably encodes a 
scaffold portion of the parent peptide or of the parent ribo- 
nucleic acid which serves to stabilize said polypeptide frag- 
ment or ribonucleic acid fragment . 

30 As mentioned above, the use of enzyme modulator scaffolds also 
provides for expression of stable, biologically active modula- 
tors which interact with other biomolecules than enzymes. In 
other words, in cases where random nucleotides are inserted in 
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a scaffold structure, the outcome will be a stable expression 
product, the activity of which does not necessarily have any- 
thing to do with enzyme activity modulation. 

Hence, another part of the invention is a method for idontify- 
5 inq a modu.l rtor in the form of a biologically active polypep- 
tide fragment or ribonucleic acid fragment which is capable of 
aeioc-aLiiy moauiacinq, in vivo, a pnenotypic trait m a cell, 
the method comprising the atopa of 

(a) preparing a pool of expression vectors, each vector of 
10 said pool containing at least one member from a library of 
randomly modified nucleotide sequences derived from a parent 
nucleotide sequence encoding a parent peptide or parent ribo- 
nucleic acid which in vivo modulates activity of a known 
enzyme, wherein the randomly modified nucleotide sequences 
15 comprise 



(b) transforming a population of substantially identical cells 
with said vectors of said pool so as to obtain transformed 
cells, (c) culturing said transformed cells under conditions 
facilitating expression of said randomly modified nucleotide 

25 sequences, (d) examining said transformed cells and isolating 
transformed cell (s) wherein the preselected phenotypic trait 
is modulated by the presence of the expressed randomly modi- 
fied nucleotide sequence, and (e) identifying the modulator by 
determining said randomly modified nucleotide sequence of said 

30 vector present in cell (s) isolated in step (d) and/or deter- 
mining the amino acid sequence or the ribonucleic acid se- 
quence of the expression product encoded by said randomly 
modified nucleotide sequence. 



an invariable part encoding a scaffold portion of the 
parent peptide or of the parent ribonucleic acid, said 
scaffold portion serving to stabilize said polypeptide 
fragment or ribonucleic acid fragment, and 



20 - 



random nucleotides, 
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Finally, the invention also pertains to the general use of 
int racel lularly stable scaffold proteins, ribonucleotides, or 
fragments thereof for the presentation of random sequences in 
the CellScreen™ technology. As mentioned above, although the 
5 concept of using scaffold molecules has been discussed in the 
prior art, the issue of the stability of the scaffold system 
has not been detailed. 



The stability and usefulness of a putative intracellular scaf- 
fold is dependent on a number of factors. First of all, it is 

10 essential that the relevant cell wherein the scaffold is to be 
expressed is capable of expressing the scaffold molecule in a 
functional form; that is, in prokaryotic systems some eukaryo- 
tic proteins will not fold correctly, hence rendering the use 
of such a protein unsuitable as a scaffold in that type of 

15 cell. Second, the scaffold should be relatively resistant to 
the reducing and catalytic environment inside intact cells. 
However, even when a scaffold molecule is relatively suscep- 
tible to the inactivating nature of the intracellular environ- 
ment, this can be remedied if the production rate of the 

20 scaffold molecule is sufficiently high. 

In steady state, the intracellular concentration of a scaffold 
molecule will be a function of the following formula: 

c 

^scaffold ~ Rj 

-where R p is the rate of production of the scaffold molecule 
25 (moles * s' 1 ) and ^ d is the inactivation constant for the scaf- 
fold molecule (1 * s _1 ) , i.e. the rate of inactivation of the 

scaffold molecule is determined by ^f- = scaffold (which, i n 

steady state, of course equals R p . In other words, when assess- 
ing the suitability of a potential scaffold molecule, it 
30 should according to the present invention be tested whether 

the molecule can be kept at a sufficiently high concentration 
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inside the relevant cell wherein the CellSoreen™ test is going 
to b e o arri e d out. 



Therefore, a very broad aspect of: the invention pertains to a 
method for identifying a modulator in the form of a biolo- 
l > qica 1 1 y act ivc polypeptide f ragment or r ibonucleic acid f raq- 
ment which is capable of cie foe tab 1 y modulating, in vivo, a 
phenotypic trait of a cell, the method comprising the steps of 

(a) preparing a pool of expression vectors, each vector of 
sard pool containing at least one member from a library of 

10 random Ly modified nucleotide sequences derived from a parent 
nucleotide sequence encoding a parent peptide or parent ribo- 
nucleic acid which is stable int. race 1 iularly, wherein the 
r a n d om 1 y modified nucleotide s e q u e n c e s c omp rise 

an invariable part encoding a scaffold portion of the 
15 parent peptide or of the parent ribonucleic acid, said 

scaffold portion serving to stabilize said polypeptide 
fragment or ribonucleic acid fragment, and 
random nuc loot ides , 

(b) transforming a population of substantially identical cells 
20 with said vectors of said pool so as to obtain transformed 

cells, (c) culturing said transformed cells under conditions 
facilitating expression of said randomly modified nucleotide 
sequences, (d) examining said transformed cells and isolating 
transformed cell(s) wherein the preselected phenotypic trait 

25 is modulated by the presence of the expressed randomly modi- 
fied nucleotide sequence, and (e) identifying the modulator by 
determining said randomly modified nucleotide sequence of said 
vector present in cell(s) isolated in step (d) and/or deter- 
mining the amino acid sequence or the ribonucleic acid se- 

30 quence of the expression product encoded by said randomly 

modified nucleotide sequence. In this aspect of the invention, 
the expression product of the nucleic acid sequence which 
encodes the intracellularly stable parent peptide or parent 
ribonucleic acid is one which, when produced by the substan- 
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tially identical cells, is present in an effective concentra- 
tion and in a functional state. 

In other words, it is essential that the suitability of the 
scaffold molecule is evaluated prior to performing the steps 
5 of the CellScreen™ technology in order to confirm that the 
scaffold molecule in unmodified form can be expressed and 
maintained at a sufficiently high concentration/activity in 
the cellular system where the method of the invention is to be 
exercised. 

10 According to WO 96/38553, the isolation of a drug target mole- 
cule can be made more efficient if the random peptide sequen- 
ces are inserted into larger polypeptides functioning as 
scaffolds for display of the random amino acid sequences . Such 
scaffolds would probably also lead to higher affinity interac- 

15 tion with the target molecule. 

Nothing is, however, mentioned about the use of scaffolds de- 
rived from naturally occurring protein inhibitors of enzymes. 
Inhibition of enzymatic activity by such inhibitors - as 
opposed to the simple binding of a target protein inside a 
20 cell as was suggested in the CellScreen™ technology - is a 
much more efficient way to affect intracellular biochemical 
events . 

DETAILED DISCLOSURE OF THE INVENTION 
Definitions 



25 In the following, a number of terms will be defined for the 
purposes of the present disclosure: 
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A "modulator" is in the present context a biomolocule which, 
when expressed in vivo, effects the activity of another biomo- 
leeule in the cell. Thu:. , the modulator in essence can inhibit 
or enhance the activity of the biomoLecule . furthermore, the 
5 mo- i\\ ) it < r < an interact directly with the biomolocule, but the 
-offers might as well be indirect, i . o . the activity change of 
the b i omo 1 ecu 1 is brounht about, by ehrinqef:' in the cell's 
riiocliomica I machinery, changes which are ultimately the result 
of the presence of the expression product of the randomized 
10 n u c 1 e i c a c i d s e q u e nee. 



A "randomly modified nucleotide sequence" is a nucleotide se- 
quence which in a number of positrons has been subjected to 
insertion or substitution by nucleotides, the nature of which 
cannot, be predicted. In many cases the random nucleotides or 

15 nucleotide sequences inserted will be "completely random" 

(e.g. as a consequence of randomized synthesis or PCR-mediated 
mutagenesis) . However, as will appear from the disclosure 
below, the random sequences can also include sequences which 
have a common functional feature (e.g. reactivity with a 

20 ligand of the expression product) or the random sequences can 
be random in the sense that the ultimate expression product is 
of completely random sequence with e.g. an even distribution 
of the different amino acids. 



"Substantially identical cells" is a term herein intended to 
25 designate cells which all exhibit a specific phenotypic trait 
in such a manner that a change in the expression of said trait 
in one cell due to an interaction effected by the introduction 
of random nucleotides according to the inventive methods would 
also occur in one of the other substantially identical cells 
30 had these been transformed with the same vector, in other 
words, the important parameter to assess when choosing sub- 
stantially identical cells in the inventive methods is whether 
an observed change in one cell's exhibition of the phenotypic 
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trait can be taken as an indication that any other cell in the 
population would have bohavod the same way as a consequence of 
the same change. Hence, substantially identical cells can for 
instance be clonal cells or cells of a cell line or they can 
be cells of a cell culture or a tissue culture. 



A "phenotypic trait" is the observable result of a certain 
gene composition in a cell (genotype), i.e. a property of a 
cell (detected by chemical, physical, immunological or any 
other suitable means) which depends on the presence of one or 
10 several genes and the expression rate thereof. Thus, the 

phenotypic trait can be any of a number of different proper- 
ties: activity of an enzyme, effects of interaction between 
receptors and ligands, cell survival rate, presence or absence 
of an antigen, expression rate, etc. 

15 "Peptide" is in the present context intended to mean both 

short peptides of from 2 to 10 amino acid residues, oligopep- 
tides of from 11 to 100 amino acid residues, and polypeptides 
of more than 100 amino acid residues . Furthermore , the term i s 
also intended to include proteins, i.e. functional biomole- 

2 0 cules comprising at least one polypeptide ; when comprising at 
least two polypeptides, these may form complexes, be cova- 
lently linked, or may be non-covalent ly linked. The polypep- 
tide (s) in a protein can be glycosylated and/or lipidated 
and/or comprise prosthetic groups. 

25 When using the term ^biologically active" to designate a 

molecule is herein meant that the molecule in question exhi- 
bits a detectable effect on living cells, i.e. that the mole- 
cule interacts with the biology of the living cell so as to 
produce an effect which can be recognized as a change in the 

30 cell's phenotype. 
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"In vivo" is herein mean to designate the environmental condi- 
tions inside living cells (i.e. cells which are met abo 1 ical 1 y 
active and can maintain their vital functions); the Living 
ceils may bo kept in culture or may be present in a natural 
5 habitat ( e . g . tunctioninq as part: of a larger, multicellular 
organism). Thus, the term "in vivo" also refers to in vitro 
cul taring of cells as long as the effect; being observed is 
t a k i n g p L a c o in t h e living c e 1 1 . 

"Transformation": A process by which the genetic material car- 
lo ried by an individual cell is altered by incorporation of exo- 
genous ON A into it 3 genome . 

"Transf ect ion" : The uptake, incorporation, and expression of 
recombinant DNA by cells. 

"Transduction": The transfer of genetic information from one 
15 cell to another by way of a viral vector. 

The term "effective part" when used in the context of a pro- 
tein, peptide, or ribonucleic acid is in the present context 
intended to mean a part (e.g. a subsequence or, in the case of 
an numeric protein, a less- than-n-meric molecule) which has 

20 retained the desired functionality of the native molecule from 
which the protein or peptide is derived. For instance, in the 
case of CI-2A only the truncated form of the molecule seems to 
be necessary to ensure expression of the active inhibitor 
intracellularly, and hence the truncated form of that molecule 

25 constitutes an effective part of CI-2A; cf. also the Examples 
herein . 

Unless otherwise indicated, nucleotide sequences are presented 
herein in the 5' -3' direction and amino acid sequences are 
presented so as to set out with the N-terminus at the left. 
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the, invent ive method 
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In general, the disclosures in WO 96/38553 and WO 97/27212 
relating 

to preparation of randomize:! sequences, to the choice of 
fusion partners (except for the choice of scaffolds) for 
the random sequences, to the choice and composition of 
targeting sequences, to the choice of nucleic acids to be 
randomly modified, to the choice of randomization method, 
to the methods of introducing the nucleic acids into the 
relevant cell type, to the choice of (retroviral) vectors 
(where applicable), to the method of producing the vec- 
tors, the choice of promoters, to the choice of packaging 
cells (where applicable), to the methods of concentrating 
infectious virions from the packaging cells (where appli- 
cable), to the choice of substantially identical cells to 
use in the method, to the type of phenotypic changes 
detected, to the manner in which, the change is detected, 
to methods of isolating the phenotypically changed cells, 
to the isolation and sequence determination of the ran- 
domly modified sequences, to the isolation and character- 
ization of the target for the randomized product, to 
screening methods, and to the choice of applications of 
the methods 

also are relevant for the purposes of the present invention. 
Therefore, the disclosures of these two patent applications 
are hereby incorporated by reference herein. However, since 
both of these references are focussed on the use of the gen- 
eral principle in higher eukaryotic cells, the present disclo- 
sure will also detail on embodiments pertaining to the use in 
prokaryotic systems, cf . below. However, the more general part 
of the two above-referenced disclosures which without any 
difficulty for the skilled person could be applied in the 
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context of a prokaryotic system are also regarded as relevant 
and important embodiments of the present invention insofar as 
it relates to the use of the methods in prokaryotic systems. 

It is normally preferred that the trans termed cells which are 
r > being examined in step (d) predominantly carries (and expres- 
ses) one sinqle copy of the vector. By ensuring this, the 
iiiL'ji piuLdLiuii ui u eudiKju in pnenocype ol me cells becomes a 
much easier task, whereas the interpretation of a phenotype 
change in cells expressing more than one single randomized 
10 sequence renders unclear which of the transforming vectors is 
r e s p on s i b 1 e f o r t. h e c h a n g e . 

To ensure that predominantly one vector nas transformed each 
of the cells examined it is e.g. feasible that, the transforma- 
tion step (b) is performed under such conditions that the 

15 cells transformed are predominantly or at most transformed 
with one single vector from said pool (this can e.g. be 
achieved by adjusting the concentration of infectious virions 
in embodiments of the invention where the transformation is 
obtained by means of transduction) , or wherein, prior to 

20 carrying out step (d) , cells being transformed with more than 
one vector from said pool are substantially excluded from the 
further steps. This latter option reguires that it is possible 
to quantify the number of transforming vectors and this can be 
achieved by including a detectable marker in the expression 

25 product, e.g. a flourescent probe. Another option is to re- 
screen cells which exhibit changes in phenotype, thereby 
ascertaining whether more than one vector has transformed the 
cell. 

It will be understood that the molecule chosen for the purpose 
30 of being a scaffold, and wherein the random sequences are 

ultimately introduced, can be either a peptide sequence or a 
nucleic acid sequence, such as an RNA fragment interfering (in 
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an ant isense-manner ) with mRNA, t RNA or with a ribozyme. 
Alternatively, such an RNA fragment could exhibit ribozyme 
activity itself, thereby having an indirect influence on the 
expression rate of other enzymes. At any rate, the resulting 
5 product, i.e. the randomized expression product, can be a 
peptide or a ribonucleic acid such as a ribozyme. 

One important feature of the scaffold is, as mentioned above, 
that it is stable towards proteolytic attack and/or is 
insensitive to a reducing environment, such as the one which,- 
10 is found intracellularly . 

In preferred embodiments of the invention the random nucleo- 
tides are introduced in part(s) of the parent nucleotide 
sequence which encode (s) the active site(s) of the parent 
peptide or parent ribonucleic acid, or the part(s) which 

15 encode (s) structure (s) interfering with the active site (s) . As 
discussed above, the active site (as well as other exposed 
structures of the scaffold) need to be stably presented to the 
environment in order to be able to interact with other 
biomolecules. Hence, preferably the invariable part of the 

20 nucleotide sequence encodes truncated parts of the parent 
peptide or parent ribonucleic acid sufficient to maintain 
stability of the randomized product. 

In some embodiments it is preferred that the invariable part 
of the parent nucleotide sequence encodes a peptide which is 

25 free from disulfide bridges. This is due to the fact that 
disulfide bridges are not formed in the nucleus or in the 
cytosol. Hence, in cases were the scaffold must be in a func- 
tional state when present in the nucleus or the cytosol, it 
would normally be preferable to use a scaffold which does not 

30 contain disulfide bridges or which do not rely on these in 
order to maintain stability and functionality. On the other 
hand, in embodiments where it is desired that the randomized 
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cxpr e s s ion product i s c o n t i nod to the E R , o r t o a noth e r c o m - 
partment allowing for the presence of stable disulfide brid- 
ges, before the randomized sequence is pre- sen ted to the envi- 
ron i n e n L in a satis f actor y ma n ner, .1 1 . i s o f c - ou r s c : <: i e s i r a b 1 e 
5 that the invar iable part of the parent nucleotide sequence 

encodes < i peptide having disulfide bridges, because the chan- 
ces of having a correctly folded and functional scaffold 

vn. l.. LUt.: :.>u^ii U^JIll^'rl i LHlOilLS IS L'e IcU. 1 VCLy Sill a L .1 - 

It will be understood that the random nucleotides are prefer- 
10 ably introduced in the form of an insertion or a substitution 
into the parent nuc bo< a i d> > sequence, opt ionally in combination 
with deletion (s) in trie parent nucleotide sequence. Deleted 
sequences in the parent polypeptide could e.g. be parts of an 
active site, the presence of which in unaltered form is toxic 
15 or otherwise d e 1 e t e r i o us t o the transfo r me d c e lis. 

The number of random nucleotides introduced can vary to a 
great degree but normally the number is between 3 and about 
100. In this range it is preferred that at least 5, such as at 
least 7, and better, at least 9-12 random nucleotides are 

20 introduces. On the other hand, it is preferred that at most 
90, such as at most 70 or 80 random nucleotides are intro- 
duced. The most preferred number of introduced randomised 
nucleotides varies between 15-60, preferably 20-55 or 25-50 
nucleotides. Especially preferred numbers of random nucleo- 

25 tides are 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 
31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 
46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, and 60 
nucleotides . 

The random nucleotides are introduced in the scaffold in the 
30 form of nucleotide sequences and/or in the form of single 

random nucleotides introduced at specific sites in the parent 
nucleotide sequence. A variation is to substitute a part of 
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the sc;affolci sequence with a sequence which retains parts of 
the scaffold sequence (e.g. those which are essential for 
stability/functionality) but where other parts are randomized. 



The random nucleotides are preferably selected from the group 
5 consisting of 

synthetic, completely random deoxyribonucleot ides ; 
synthetic random DNA sequences, wherein limitation on 
randomization of some nucleotides is introduced so as to 
limit the number of available sequences and/or to avoid-- 
10 undesired stop codons and/or to facilitate introduction 

of post-trans lat ionai modifications of expressed pepti- 
de (s) ; 

- synthetic random DNA sequences as in (1) or (2) coupled 

to a sequence encoding a purification tag; and 
15 - CDR encoding nucleotide sequences isolated from a library 
of immune- competent cells rai sed against an antigen (in 
this embodiment it is preferred that CDR encoding nucleo- 
tide sequences encode CDR-3 peptide sequences). 

The latter type of "randomization" actually introduces a re- 
20 striction on randomness which ensures that the sequences 
introduced encodes an antigen recognizing region. It is, 
however, well known that a polyclonal immune response against 
an antigen consist of a large number of immune competent cells 
which all react with the same antigen (or perhaps even with 
25 the same epitope) but the correlation between amino acid 
sequences of the CDRs and recognition of the epitope (s) is 
virtually impossible to deduce. 

An alternative way to introduce limitations on the randomness 
of the nucleic acid sequences which are ultimately tested in 
30 the substantially identical cells is the following: Upon 
preparation of the vectors, they are used in a 1 st round of 
phage display, where the phages transformed with the vectors 
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are panned against a library containing a ligand of choice. As 
for the technique of employing CDR encoding sequences, the 
result is that the sequences which are ultimately tested in 
the sunstant ially identical eel Ls are "unpredictable" (and 
5 t he r oby random ) bu t neve r c he 1 e s s so 1 oo t ed or : t he ba s i s o f a 
functional feature. Again, t h« • lack < ) f kne >wn era. r>.- ial ion 
between nucleic acid sequences and t he interact: ion in three- 
a Linens ionai spac- Detween the expression product am:] a ligand 
of choice has the consequense that the tested subgroup of 
10 sequences still is randomized. 

In a special embodiment of the above-technique where the 
method of the invention is combined with phage display, both 
test systems are repeated in an alternating manner, that is a 
shuffling between intracellular expression in the substantial 
15 identical cells and panning of a phage library. 

In order to obtain an controlled distribution of amino acids 
in the randomized peptides, when the modulator is a peptide, 
it is practical that the random nucleotides are prepared by 
random codon synthesis where defined DNA codons are synthe- 

20 sized in a random order; a thorough description of this prin- 
ciple is given in WO 96/38553, cf. Example 1 therein. The 
preferred embodiment in this context is one wherein the rela- 
tive amount of synthesized codons ensure that all encoded 
amino acids will be present with substantially the same fre- 

25 quency in the total of encoded polypeptide fragments, i.e. 
that the chance of encountering one specific amino acid in a 
library peptide is substantially the same as for any other 
encoded amino acid . 

In order to introduce the randomized fragments properly into 
30 the vectors, it is according to the invention preferred that 
the random nucleotides are introduced into the expression 
vector by the principle of site directed PCR-mediated mutagen- 
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esis. However, other options are known to the skilled person, 
and it is e.g. possible to insert synthetic random sequence 
libraries into the vectors as well. 

Apart from having the randomized fragment of the expression 
5 product introduced into a scaffold in accordance with the pre- 
sent invention, it is often necessary to couple the random 
sequence to a fusion partner by having the randomized nucleo- 
tide sequence fused to a nucleotide sequence encoding at least 
one fusion partner. Such a fusion partner can e.g. facilitate 
10 expression and/or purification/isolation and/or further 
stabilization of the expression product. 

For the purposes of purification, the fusion partner can 
include a purification tag such as His 6 tag, myc tag, BSP 
biot inylat ion target sequence, of BirA, flu tag, lacZ, and 
15 GST. Furthermore, the fusion partner may include a sorting 
signal or a targeting sequence, cf . the discussions below. 

In embodiments where the modulator is itself a modulator of 
enzyme activity, it is in theory possible to effect both the K M 
and/or the V rTiax of the relevant enzyme. A reduction in K M re- 

20 suits in less effectivity of the relevant enzyme insofar that 
an increased substrate concentration is required to obtain 50% 
of maximum activity of the enzyme. An increase of K M has the 
opposite effect. Of course, interference with an enzyme which 
effects V mnx has as a consequence that the maximum possible rate 

25 of activity of the enzyme is increased (when Vmax is increased) 
or decreased (when VlTiax is increased) . At any rate, the modula- 
tor of the enzyme activity will give the phenotypic impression 
that the enzyme activity has either been inhibited or stimu- 
lated. It is preferred in this embodiment that the modulator 

30 is an inhibitor. 
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When the method of the invention has finally lead to the 
identification of a modulator or of a target therefor, it is 
preferred that the 3-dimensional structure of the identified 
modulator is resolved, since this allows for the implementa- 
tion of rational drug screening and computer drug modelling 
methods . 



— • — — ^ — ' '•• — l -ii"-" l - Lve iiu m.iiou;s in proKary o tic systems 

The originally envisaged technology disclosed in WO 96/38553- 
focussed on screening for interactions in eukaryotic cells. 
10 However, the technology is also applicable in prokaryotic sys- 
tems . 

For instance, it is expected that the present invention will 
allow for identification of hitherto unknown interactions in 
pathogenic bacteria, interactions which will be useful in the 

15 course of developing new antibiotics. Since the inventive me- 
thods allows for the identification of both novel ligand pep- 
tides and ribonucleic acids as well as of the target molecules 
for these ligands, the investigator is provided with the 
necessary tools for instigating computer drug modelling and 

20 for performing traditional drug screening, once such ligands 
and/or targets have been identified/isolated. 

However, apart from the approach of identifying antibacterial 
effects and substances, the method also opens up for improve- 
ments in industrial fermentation processes. In such cases it 
25 will e.g. be possible to identify biomolecules which are 

important m the biochemical pathways in lactic acid bacteria 
and thereby provide tools for the production of new dairy 
products such as cheese, yoghourt, and other products of 
lactic acid bacterial fermentation. 
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Somewhat related to this approach is the use of the methods of 
the present invention in screening performed on bacterial cul- 
tures used in purification processes. It is well-known in the 
art of e.g. waste water purification that the microbiological 
5 cultures (activated sludge) which conduct the degradation of 
organic material, are relatively vulnerable vis a vis changes 
in the environment and therefore the provision of more robust 
strains of bacteria would be one way to improve such systems. 
Alternatively, the method of the present invention would also 
10 allow for the ident i f ica t ion/ isola t ion of ligands and targets 
in such bacteria which, when interacted with, can lead to e.g. 
increased efficacy in degradation of specific organic or inor- 
ganic substrates . 

As will be appreciated from the above, the present invention 
15 therefore is highly useful in prokaryotic systems. 

For the purposes of using the method of the invention in 
prokaryotic cells, it is preferred that the prokaryotic cells 
are bacteria selected from the group consisting of Bacillus 
spp. (e.g. £. anthracis, B. subtilis and B. cereus) , 

20 Clostridium spp. (e.g. C. botulinum, C. difficile , C. perfrin 
gens, and C. tetani) , Corynebacterium spp. (e.g. C. diphthe- 
riae, and C. pyogenes) , Staphylococcus spp. (e.g. S. aureus 
and S. albicans) , Streptococcus spp. (e.g. S. pneumoniae, S. 
pyogenes, and S. agalactiae) , Escherichia coli, Serratia 

25 marcescens, Klebsiella spp. (e.g. K. pneumoniae) , Proteus spp 
(e.g. P. mirabilis) , Citrobacter spp. (e.g. Citrobacter freun 
dii) , Salmonella spp. (e.g. S. typhi, S, typhimurium, S. 
shottmulleri and S. paratyphi) , Shigella spp. (e.g. S. 
dys ten teriae, S. flexneri, S. boydii, and S. sonnei) , Pseudo- 

30 monas spp. (e.g. P. aeruginosa , P. pseudoma llei , and P. mal- 
lei) , Acinetobacter spp., Aeromonas spp., Plesiomonas spp., 
Yersinia spp. (e.g. Y. pestis, Y. enterocol it ica , and Y. 
pseudotuberculosis) , Francisella tularensis, Vibrio spp. (e.g 
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V. choleras and V\ pdrjiwemolyticua) , Campylobacter spp. (e.g. 
C. jejuni and C. coli), Helicobacter pylori, Haemophilus spp. 
(e.g. //. inflnenzea, //. pa ra influenzae , and //. aeqypti us) , 
Bordetella spp. (e.g. B. pertussis, B . parapertussis, and B. 
5 i.ronr/ii.wptiM), /<rucei Z<> spp., Ncixsori.i spp. (e.g. A/, gonno- 
rhoeae and A/, /non ingi tidis) , Treponema pallidum, Leptospira 

i n te r rcoa n s ; Rnrr f W 7 .-l Qnn i ,~, ,< o i^,, „ vr.. . . ... 

' ' ^ 1 -'i-^t--- v v. . c, . ^. i/ij i. iJlkaI i c;i i seusu stricr.o, 

B. garinii, B. af::elii, and B. recurrent is) , Legionella 
pneumophila, Listeria monocytogenes, Mycobacterium spp. (e.g. 
10 M. tuberculosis, M. bovis, M. africanum, M. kansasii, and M.' 
leprae), Treponema pallidum, Chlamydia trachomatis, Actino- 
myces spp . , Ri cket tsia spp . , and My cop 1 a sma spp . (e.g. M. 
pneuinon iae) . 

This list of bacteria thus entails bacterial families and spe- 
15 cies which are involved in pathology of a large number of dis- 
eases in humans. Of these, E. coli and B . subtilis are also 
used for industrial fermentation; this is also the case for 
lactic acid bacteria, notably Lactococcus spp. and Lactobacil- 
lus spp. and therefore it is also preferred that the inventive 
20 methods, when employed in bacterial systems, are performed on 
such non-pathogenic species. 

When using the inventive methods in a prokaryotic system, it 
is very often interesting to identify those cells which are 
impaired in growth or lethally damaged due to the presence of 
25 the heterologous expression product introduced according to 
the invention. This, however, is not completely unproblematic 
since the main phenotypic trait associated with e.g. bacterial 
death is absence of the bacterium. 

It is therefore necessary to device an experimental setup 
30 which will allow identification of cells transformed so as to 
be less good survivors than other otherwise corresponding 
cells. One advantage is that expression of the heterologous 
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qenetic material is under the control of an inducible pro- 
moter. In this way it is possible to expand colonies of cells 
which have been transformed with genetic material which, when 
expressed, is lethal or growth- impair ing to the cells. After 
5 that, the expression can be switched on and careful examina- 
tion of expanded colonies should reveal those clonal colonies 
which do not follow the same growth pattern as e.g. an un- 
trans formed control . 

One method of doing this is to spread transformed cells on 

10 plates with growth medium and allow the cells to grow up to a 
pre-determined average size. The spreading of cells should be 
such that the visible colonies forming will generally be com- 
prised on one single clone of cells. When the pre-determined 
size of colonies have been reached, all plates are blotted to 

15 a carrier medium so as to prepare a "negative" of each agar 
plate. After this, the expression of the inserted random 
sequences is induced and the colonies are allowed to grow 
again. The plates are examined continuously or at suitable 
intervals (e.g. by means of digital image processing systems 

20 well known to the skilled person) . Those colonies which reveal 
an impaired or arrested growth compared to the remainder of 
the colonies or compared to controls are thereafter identi- 
fied, since the growth pattern of each colony in an automated 
manner can be followed. These colonies can then be identified 

25 and isolated from the "negative" blot and it is thereafter a 
relatively simple procedure to extract the transforming ge- 
netic material and determine the sequence thereof. 

In this context, one interesting option is to render 
antibiotic-resistant bacteria non-resistant. The growth medium 
30 either contains, or is during culturing enriched with, the 
antibiotic in question and the colonies which upon induction 
of expression can be demonstrated to be less drug resistant 
than controls are examined further. Also pure 
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bactericidal/bncteriostatic effects can be examined. In such 
an embodiment, the bacteria are e.g. cultured on a suitable 
growth medium. Those colonies wn.cn after induction of expres- 
sion shows evidence of reduced or arrested growth are examined 
5 further: It is expected that some of the bacterial cells will 
be demonstrated to carry genetic material encoding an expres- 

s ion product which int-pr^rt-Q t.t-j+-k i / v . L _ _ , . 

i^.-vtr.i. lt 'i Known) targets tor 

u 1 1 l i ud c lenai a qen t s . 

Another phenotypic trait of interest is of course superior ■ 
10 survival of cells. It is, when dealing with utilisable bacte- 
ria, of interest to identify targets which will increase the 
survival rate of the bacteria. For example, bacteria used in 
industrial fermentations normally can be lethally damaged as a 
consequence of their own uncontrolled production of heterolo- 

15 gous expression products. If genes or target molecules can be 
identified which have a positive effect on the survival of 
such bacteria, the economic potential is enormous, since a 
fermentation process will be rendered more economic (less need 
to startup of new fermentations). Similarly, bacteria used in 

20 e.g. waste water purification can be made more resistant 
against toxic agents in their environment. 

The experimental setup in this context is relatively simple: 
The transformed bacteria are simply subjected to the poten- 
tially lethal condition, and only colonies which exhibit a 
25 superior survival are isolated and examined (and that will 

typically be the colonies which are detectable). A setup like 
the above-described for identifying cell death should thus not 
be necessary . 

Finally, a large group of phenotypic traits to be examined are 
30 those which can be detected by e.g. biochemical or immunologi- 
cal means. It is expected that the method of the invention 
will allow for identification of systems in bacterial cells 
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which, when properly modulated, can render the bacteria more 
effective as producers in industrial fermentation. Such 
phenotypic traits could e.g. be changes in enzyme activity, 
changes in receptor density, changes in expression rate etc. 



5 Special considerations apply when the randomized expression 
product is fused to a fusion partner which decides the final 
location of the expression product. Signal sequences in 
prokaryotes are well-known in the art, but it should briefly 
be mentioned that membrane-anchoring signals are known, and it 

10 is also possible to export the expression product to the 

periplasmic space of bacteria. Finally, it is also possible to 
include secretion signals so ats to allow the isolation of the 
expression product from culture supernatant. However, in many 
cases it is of course most relevant to keep the expression 

15 product inside the prokaryotic cytoplasm. 



Use of the method in eukarvotic systems 



It is especially preferred that the inventive method utilises 
eukaryotic cells as the substantially identical cells in order 
to allow screening for active biomolecules . Hence, these 
20 eukaryotic cells can be fungal cells, protozoan cells, animal 
cells , and plant cells . 



As is the case for bacteria , a number of fungi are pathogens 
in mammals, and therefore the present technology will, in 
parallel with what has been described above concerning anti- 

25 bacterial agents, be useful for identifying antifungal agents 
by using pathogenic fungi as the substantial identical cells 
in the method. Furthermore, fungi (especially yeast strains), 
like bacteria, are also utilised in fermentation processes 
(e.g. in the wine and brewing industries), and the method can 

30 therefore also be utilised using such non-pathogenic fungi as 
the substantially identical cells which are transformed with 
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the vectors, whoreb y impr ove me n t s in these strain s c a n b e 
obtained . 

['referred exampLes of fungi .serving as the eukaryotie cell in 
t h e i. n v e nt L v o m e t h r- <. h ; are P 'p id o rm o p h y r . - ■ n s p p . , 7 1 r Z c h op Iiyton 
b spp>., Microsporum spp., Candida albicans, Phi I opho ra spp. r 
Coia; / d Z o idea iffl/n i t i s, ZZi a top id szna caps a la L uj/i , B i tis corny cos 
d o 1 7 n atiti d i s , I \a la 2 c . ' > c c ; Z ' i i o ido s b r a a i Z i o n s Z a , C ryr. ?t ococcus ; 
neoforvnans, Asporg.i Llur> spp., 5a ccha romyces corovrs iae, 
a.J yve rOin} r ces i a c r. i s , and Z \i cci a pa a t o ris. 

10 Unlike fungal cells, protozoan cells are: only relevant, as 

pathogens for human- and '"other mammals, although some protozo- 
an:-, form part of bi ocul t u res conducting biological waste water 
purification. The method of the invention is therefore contem- 
plated to be useful in identifying new targets for 

15 antiprotozoal! agents. In this context, the preferred protozoan 
cells used as the substantially identical cell in the methods 
of the invention are selected from the group) 'consisting of 
Giardia iambi ia , Trichomonas vaginalis, Dientamooba fragi lis , 
Trypanosoma spp., Leishmania spp., Entamoeba histolytica, 

20 Naegleria fowleri, Acanthamoeba castellani, Harmanella spp., 
Isospora belli, Cryptosporidium spp., Sarcocystis spp., 
Toxoplasma gondii, Plasmodium spp. (e.g. P. falciparum, P. 
vivax, P. malariae, P. knowlesi, and P. ovale), Babesia spp., 
and Balantidium coll. 

25 Also plant cells are according to the invention interesting as 
target eukaryotic cells. The plant cells can be any plant cell 
which can be subjected to genetic engineering techniques 
allowing for single cell expression and growth. Thus, cells 
derived from e.g. Nicotiana tabacum (tobacco plant), Arabidop- 

30 sis thaliana , Brass ica napus, Brassica juncea, Musa sp . (ba- 
nana plants), rice, and corn are examples of plant cells 
useful in the invention. The skilled person in the art of 



WO 00/05406 




PCT/DK99/0040S 



26 

plant genetic engineering will know to choose suitable plant 
cells in the appropriate stage of their life cycle, suitable 
vector systems as well as suitable transformation methods. A 
short summary is given here: 



5 As other organisms, plant cells can be transformed with 

foreign DNA. One strategy for plant transformation em- 
ploys Agrobacter ium tumefaciens, a naturally occurring 
plant pathogenic bacterium which contains a plasmid (the 
Ti plasmid) with the ability to enter plant cells and 

10 insert a portion of its genome into plant chromosomes. 

The Ti plasmid has been engineered to make it a vector 
for plant transformation by including sequences for re- 
plication in E. coli and Agrobacter ium, unique restric- 
tion sites for inserting foreign genes, and selectable 

15 markers. 



Unfortunately, Agrobacter ium is not very effective at 
transforming monocots, a large group of plants that in- 
cludes all of the agriculturally important cereals. Le- 
gumes, another important group of food crops, are also 

20 difficult to transform and regenerate with Agrobacter ium . 

Therefore, a second strategy for transforming plants has 
been developed involving the Gene Gun, where a gene is 
inserted into an expression vector and coated onto beads. 
The DNA-coated beads are then introduced by means of the 

25 gene gun into plant cells, where a small fraction are 

taken up and incorporated into the DNA. Individual plant 
cells, callus, regenerating shoots, and embryos are all 
suitable targets in this technique. 



30 



To determine whether cells have actually incorporated 
foreign DNA and become transgenic, reporter genes such as 
GUS and Luciferase genes are used. In any case, foreign 
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genes must be flanked by a plant promoter in order to be 
e x p r ossecl. 

It is preferred to use cells derived from animals. These cells 
c: a r l b e ma mm a 1 i a n col Is, a r t h r <: ;> p ► o d c e 1 1 s s u c h as in s e c t c e lis, 
5 avian cells, and piscine cells'. A number of reasons -.ran be 

listed for usinq such cell types which each require a relevant 
^. «... u i l r a n s r o r rn a tion < \ n d e x p r ession s y s t ems , g r o w t h concli- 
t ion s , e tc, a 1 1 e a s i 1 y d e 1 e r m i n e d and chose: n b y t h e s k i 1 1 e d 
p e r s o n . 1 1 55 u f f i c e s t o n o t e t h a t o . g . certain ins e c t s c a use - 

10 enormous problems in human society (due to their (direct damag- 
ing activities or due to their functions as vectors carrying 
infectious agents), and therefore the method of the invention 
would supplement in the attempts of controlling such insects. 
As for the mammals, birds and fish, a number of these are 

15 "important in agri- and aquacuiture, where disease control is 
o f inter e s t . 



According to the invention mammalian cells such as human 
cells/cell clones or human cell lines are most preferred. This 
is clue to the fact that a large number of diseases in humans 

20 and other mammals et iologically depend on molecular interac- 
tions in the living cell - the provision of drugs or lead 
compounds which interact in vivo with biomolecules which play 
a role in diseases is therefore of great interest. Preferred 
mammalian cells are Chinese hamster ovary (CHO) cells, VERO 

25 cells, HeLa cells, W138 cells, BHK cells, COS-7 293 cells, and 
MDCK cells, which are all well-known in the art. 



The candidate nucleic acids are hence introduced into eukaryo- 
tic cells as part of a vector to screen for modulators of 
target enzyme activity. By the term "introduced into 7 ' is 
30 herein meant that the nucleic acids enter the cells in a 

manner suitable for subsequent expression of the nucleic acid. 
The method of introduction is largely dictated by the targeted 
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cell type, cf. below. Exemplary, but non-limiting methods 
include CaP0 4 precipitation, liposome fusion, 1 ipof ect in®, 
electroporat ion, viral infection, etc. 

The randomly modified nucleic acids are preferably integrated 
5 into the host cell genome (e.g. by means of retroviral infec- 
tion of the host cell) , or may exist either transiently or 
stably in the cytoplasm (i.e. through the use of traditional 
plasmids utilizing standard regulatory sequences, selection 
markers , etc . ) . 

10 Currently, the most efficient gene transfer methodologies for 
mammalian cells harness the capacity of engineered viruses, 
such as retroviruses, to bypass natural cellular barriers to 
exogenous nucleic acid uptake. 

The vector is preferably selected from the group consisting of 
15 a retroviral vector, a vaccinia virus vector, an adenoviral 
vector, an adeno associated virus (AAV) vector, a herpes 
simplex virus (HSV) vector, an alpha virus vector, and a 
semliki forest virus vector. 



Retroviral transduction 



20 As many pharmaceut ically important screens require human or 
model mammalian cell targets, retroviral vectors capable of 
transfecting such targets are preferred. 

Therefore, the candidate nucleic acids are preferably part of 
a retroviral virion which infects the cells. Generally, infec- 
25 tion of the cells is straightforward with the application of 
the infection-enhancing reagent polybene. Infection can be 
optimized such that the cells predominantly express a single 
construct each, e.g. by using the ratio of virus particles to 
the number of cells. Alternatively, it is possible to "screen 
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out" ceils which have been infected with more than one .single 
virion, e.g. by quantitatively assessing a selection marker 
and only. The rate of infection is well-known to fed Lev; a 
bo i sson di st r i but ion . 



5 A preterred embod i ment of the invention where the substan- 
tia 1 1 y i d en t i c a 1 cells a r o e u k a r y o t i c t h u s c omp r i .< ^ e s t h a t step 

1) t rmsCecting suitable packaging cells with vector:; com- 
prising the randomly modi, fit?'! nucleotide sequences and , 

10 which are inteqratabl o in virions produced by said 

p a c k a g i n g c ells, 

2) culturing said trans feet ed packaging cells in a culture 
medium under conditions which feci lit ate- production by 
the packaging cells of virions containing the randomly 

15 modified nucleotide sequences, 

3) recovering and optionally concentrating said virions, and 

4) transducing said substantially identical cells with the 
vi r ions . 



Thus, preferably the candidate nucleic acids are introduced 
20 into the substantially identical cells using retroviral vec- 
tors. The use is well-known in the art of helper-defective 
packaging cell-lines which are capable of producing all neces- 
sary proteins (gag, pol, and env) required for packaging, 
processing, reverse transcription, and integration of recombi- 
25 nant genomes, cf. the below discussion of such cell lines. 

Those RNA molecules which have in cis a \J/ packaging signal are 
packaged into maturing virions. In eukaryotes, retroviruses 
are preferred for a number of reasons. First, their derivation 
is fairly easy. Second, unlike Adenovirus-mediated gene deli- 
30 very, expression from retroviruses is long-term (adenoviruses 
do not integrate) . Adeno-associated viruses have limited space 
for genes and regulatory units and there is some controversy 
as to their ability to integrate. Retroviruses therefore 
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currently provide the best compromise in terms of long-term 
expression, genomic flexibility and stable integration, among 
other features. The main advantage of retroviruses is that 
their integration into the host genome allows for their stable 
5 transmission through cell division. This ensures that in cell 
types which undergo multiple independent maturation steps, 
such as hematopoietic cell progression, the retrovirus con- 
struct will remain resident and continue to express. 



Preferred retroviral vectors include vectors derived from 
10 retrovirus selected from the group consisting of Avian 

Leukosis-Sarcoma Virus (ALSV) , Mammalian type C, Mammalian 
type B, and Lentivirus as well as vectors derived from MSCV 
(murine stem cell virus), modified MFG virus and pBABE, and 
optionally modified with heterologous cis-acting elements. 

15 In general, retroviral vectors should contain as few viral 

sequences as possible in order to minimize potential recombi- 
nation events. Only the sequences necessary for packaging, 
reverse transcription and integration should be retained, as 
well as the viral promoter, enhancer, and polyadenylat ion 

20 sequences. 

The library of random nucleotides can be generated in a retro- 
virus DNA construct backbone, as is generally described in 
e.g. WO 97/27212. Standard oligonucleotide synthesis generates 
the random portion of the candidate modulator, using tech- 

25 niques well known in the art (cf . Eckstein, Oligonucleotides 
and Analogues, A Practical Approach, IRL Press At Oxford 
University Press, 1991); oligonucleotide libraries may be 
commercially purchased. Libraries with up to 10 9 unique sequen- 
ces can be readily generated in such DNA backbones. After 

30 generation of the DNA library, the library is cloned into a 

first primer. The first primer serves as a "cassette" which is 
inserted into the retroviral construct. The first primer gene- 
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rally contains a number of elements, including for example, 
the required regulatory sequences (e.g. translation, tran- 
scription, promoters, etc), fusion partners and scaffold 
molecule (s) , restriction endonuclease (cloning and subcloning) 
5 sites, stop codons (preferably in all three reading frames), 
regions of complementarity for second strand priming (prefer- 
ably at the end of the stop codon region as minor deletions or 
insex Lions may occur in tne random region), etc. 

A second primer is then added, which generally consists of 
10 some or all of the complementarity region to prime the first 
primer and optional necessary sequences for a second unique 
restriction site for subcloning. DNA polymerase is added to 
make double-stranded oligonucleotides. The double-stranded 
oligonucleotides are cleaved with the appropriate subcloning 
15 restriction endonucleases and subcloned into the target retro- 
viral vectors, described below. 

in this manner the primers create a library of fragments, each 
containing a different random nucleotide sequence within a 
scaffold sequence derived from genetic material encoding a 
20 enzyme modulator. The ligation products are then transformed 
into bacteria, such as E . coli and DNA is prepared from the 
resulting library, as is generally outlined in Kitamura, PNAS 
U.S.A. 92: 9146-50 (1995), which is incorporated by reference 



25 Any number of suitable retroviral vectors may be used. Gene- 
rally, the retroviral vectors may include: selectable marker 
genes under the control of internal ribosome entry sites 
(IRES), which allows for bicistronic operons and thus greatly 
facilitates the selection of cells expressing peptides at 

30 uniformly high levels; and promoters driving expression of a 
second gene, placed in sense or anti-sense relative to the 5'- 
LTR (long terminal repeat) . Suitable selection genes include, 



herein . 
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but arc not limited to, neomycin, blastocidin , bleomycin, 
puromycin, and hygromycin resistance genes, as well as self- 
fluorescent markers such as green fluorescent protein, enzy- 
matic markers such as lacZ, and surface proteins such as CD8, 



The retroviruses may include inducible or constitutive promo- 
ters. For example, there are situations wherein it is neces- 
sary to induce peptide expression only during certain phases 
of the selection process. For instance, a scheme to provide .. 

10 pro-inflammatory cytokines in certain instances must include 
induced expression of the peptides. This is because there is 
some expectation that over-expressed pro-inflammatory drugs 
might in the long-term be detrimental to cell growth. Accor- 
dingly, in this situation constitutive expression is unde- 

15 sirable, and the peptide in only turned on during that phase 
of the selection process when the phenotype is required, and 
then the peptide is shut down by turning off the retroviral 
expression to confirm the effect or ensure long-term survival 
of the producer cells. A large number of both inducible and 

20 constitutive promoters are known to the skilled person. 

In addition, it is possible to configure a retroviral vector 
to allow inducible expression of retroviral inserts after 
integration of a single vector in target cells; importantly, 
the entire system is contained within the single retrovirus. 

25 Tet-inducible retroviruses have been designed incorporating 

the Sel f -Inactivating (SIN) feature of 3 ! LTR enhancer /promoter 
retroviral deletion mutant (Hoffmann et al., PNAS U.S.A. 
93:5185 (1996)). Expression of this vector in cells is virtu- 
ally undetectable in the presence of tetracycline or other 

30 active analogues. However, in the absence of Tet, expression 
is turned on to maximum within 48 hours after induction, with 
uniform increased expression of the whole population of cells 
that harbour the inducible retrovirus, indicating that expres- 
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sion is regulated uniformly within the infected cell popula- 
tion. A similar, related system uses a mutated Tet DNA-bindirig 
domain such that it bound DNA in the presence of Tet, and was 
removed in the absence of Tet. Either of these systems is 
5 suitable. 

According to the present invention, the most preferreci vectors 
V Li— i i.i.u.;w in Example i) are Daseci on the murine Akv retro- 
virus, a mammalian type C retrovirus (NCBI taxonomy Id 
#11791). The Akv virus has high homology with the Moloney 
10 retrovirus, commonly used in the field, A brief description of 
the design of these preferred vectors is as follows: 

The vectors contain a chimeric 5 f LTR, allowing expres- 
sion from the strong Cytomegalovirus (CMV) promoter when 
transcription is driven from the plasmid (as in transfec- 
15 tions) . Following integration of the vector into the host 

genome, transcription is driven from the retroviral LTR 
(as in transductions) . 

A versatile polylinker is present downstream of the pack- 
aging signal. This enables the insertion of peptide li- 
20 braries being part of a scaffold molecule in this posi- 

tion . 

Immediately downstream of the polylinker is an internal 
ribosomal entry site (IRES), derived from the 
encephalomyocarditis (EMC) virus or an internal promoter, 
25 originating from the SV-40 virus. This allows efficient 

translation from the downstream expression cassette ei- 
ther in a CAP independent (IRES), or in a CAP dependent 
(internal promoter) manner . 



30 



Several different marker genes have been cloned into the 
downstream expression cassette. For example antibiotics 
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resistance genes such as Neo and Hygro, fluorescent pro- 
teins such as EGFP, or surface proteins such as ANGFR. 
The- availability of vectors containing different markers 
allows for the selection of transduced cells using either 
5 drug treatment, flow cytometry or magnetic bead separa- 

t ion . 

Any scaffold protein or marker can be combined, either in 
the bicistronic vector or in a vector containing the SV- 
40 internal promoter. 

10 in view of the above, it is hence preferred that the retrovi- 
ral vector has non-identical ends so as to facilitate PCR- 
based generation of random DNA sequences. It is furthermore 
preferred that these non-identical ends contain non-identical 
promoters. An especially preferred retroviral vector contains 

15 a heterologous promoter replacing the viral promoter in the 
5 1 -LTR, such as a CMV promoter, an RSV promoter, an SV-40 
promoter, a TK promoter, an MT promoter, or an inducible 
system such as Tet or Ecdysone. 

Particularly well suited retroviral transfection systems 
20 (packaging cells) are PE501 (US 4,861,719), Bosc23 (WO 
94/19478), ^2 (R. Mulligan/D. Baltimore), GP+E86 (US 
5,278,056), PhoenixEco (WO 97/27212), PA317 (US 4,861,719), 
GP+AM12 (US 5,278,056), DA(ampho) (WO 95/10601, WO 92/05266), 
Bing (WO 94/19478), FLYA13 (WO 97/08330), ProPak (available 
25 from SyStemix), CRIP (R. Mulligan), W\M (R. Mulligan/D. Balti- 
more), Phoenix-Ampho (WO 97/27212), PG13 (Targeted Genetics), 
H9 (293GPG) (D. Ory, M Sadelain, R. Mulligan, J. Schaf fer) , 
and EcoPack (Clonetech) . 

Retroviral transduction is dependent upon the interaction 
30 between the virus envelope glycoproteins and host cell surface 
receptors. By far the two most commonly exploited receptors 
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for retroviral gone delivery are the ecotropic receptor (re- 
stricted to murine and rat cells) and the amphotropic receptor 
(widely d i s 1 r ibuted on both immortalized coll line:.; and on 
primary mamma 1 i an cells). A number o I 7 packaging cell. Lines, 
for generating either ecotropic or amphotropic viruses, ex- 
ists. In uddir.ion, packaging cell lines, which pseudotype 
retroviral part, ickas with either GALV ((ahbnri Ape Leukemia 
V i . 1 — a s I y .... v r LuL.-LH) ( i l V 3 v o [ v e s i c u 1 a r S t o i r i a t i t i s V i r u s G 
g.lycop rot ein ) have a 1 so recent 1 y been dove 1 oped . 

10 Until recently most packaging cell lines were based on NIH-3T3 
cells, sir si as tho and GPf86 (ecotropic) Lines and the 

PA317, G P f AM 1 2 and GRIP (amphotropic) lines. These have been 
used extensively and work well, particularly when stable 
producer Lines are made. However, NIH-3T3 based packaging cell 

15 lines 'jive relatively low titers when virus is generated from 
transient t rans feet ion . Over the past few years several pack- 
aging cell lines based on the highly t rans feet able 293 cell 
line (human embryonic: kidney cells) have been developed. These 
include the Bosc23 cell line (Pear et al., 1993) an ecotropic 

20 cell line which has been demonstrated to work very well within 
the boundaries of the present invention, as well as the Eco- 
Pack cell line from Cionetech. The advantage of 293 based 
lines is that very high titers (up to 10 7 infectious units/ml, 
lU/ml) of viral supernatant can be produced from transient 

25 transf ect ions in as little as 48 hours. In a library situation 
transient trans feet ions are preferred, as this gives all 
library members a chance of being equally well expressed, 
without any bias introduced by expression from different 
integration sites. 

30 Having access to well characterised stable packaging cell 

Lines is critical for gene therapy associated projects. For 
the purposes of the present invention, however, it is not 
necessary to employ such well defined lines, as the libraries 
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will always be produced from transient transf ect ions and no 
stable "producer line" will or need be established. An alter- 
native strategy for gaining entry into non-murine cells that 
has been explored by the present inventors is therefore to use 
5 a heterologous viral envelope glycoprotein to pseudotype 

viruses produced in ecotropic packaging cell lines, e.g. that 
from Vesicular Stomatitis Virus (the VSV G protein) . VSV G 
pseudotyping of retroviruses is interesting for two main 
reasons. First, the cell surface receptors for VSV G are 

10 ubiquitous membrane components, such as phosphatidylse r.ine and 
gangliosides . Pseudotyping with VSV G therefore confers broad 
tropism to the virions. Second, the VSV G protein is extremely 
stable once incorporated into the virions. This is important 
as it allows concentration of the viral supernatant by ultra- 

15 centrif ugat ion, a step which can increase the viral titers per 
volume unit by 10 to 100 fold. 

The VSV G protein is highly fusogenic and, as a consequence, 
exhibits cytotoxicity in tissue culture. It has therefore not 
been possible to establish packaging cells that express VSV G 

20 cons titutively . To circumvent this problem, inducible systems 
have been developed (Ory et al., 1996). However, because the 
CellScreen™ retroviral libraries are produced from transient 
transf ections , a very simple alternative is to transiently 
transfect VSV G, together with the library, into an ecotropic 

25 packaging cell line, such as Bosc23. This approach allows 
viral supernatants of broad tropism and high titers to be 
produced, before severe toxicity is observed in the culture. 
Using this method, a panel of non-murine target cell lines 
have been tested for transducability and titers of up to 10 ? 

30 IU/ ml of viral supernatants have routinely been achieved by 
the inventors. 



A recently reported alternative to pseudotyping with VSV G is 
to pseudotype with the envelope glycoprotein of Lymphocytic 
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Choriomeningitis Virus (LCMV) (cf . Miletic et a/., 1999, J. 
Virol. 73; 011*1-^11*3). This alternative is, also included as an 
embodiment r.f the present invention. 

After product ion in packaginq cells, concentration of virus 
5 may bo performed as follows: Generally, retroviruses are 
t i t r e d 1' y a c> p 1 yinq r e t rov i rus-c o n t a i n i n q s 1 1 p o r n a t a n t o n t o 
indioiLor certs, such as NlIT-3To cells, and then moasurinq the 
percentage of co L is expressing phenotypic consequences of 
infection. The concentration of the virus is determined by 

10 multiplying the percentage of cells infected by the dilution 
factor involved, and taking into account tne number of target 
cells available to obtain a relative titre. if the retrovirus 
cent si ins a reporter gene, such as lacZ, then infection, 
integration, and expression of the recombinant virus is mea- 

15 sured by histological staining for lacZ expression or by flow 
cytometry (FACS). In general, retroviral titres generated from 
even the best of the producer cells do not exceed 10 7 per ml, 
unless concentration is performed on relatively expensive or 
exotic apparatus. However, it is believed that particles as 

20 large as retrovirus will not move very far by means of brown- 
ian motion in liquid, fluid dynamics predictions show that 
much of the virus never comes in contact with the cells in 
order to initiate infection. However, if cells are grown or 
placed on a porous filter surface and retrovirus are allowed 

25 to pass the cells by gradual gravi tometric flow, a high con- 
centration of virus around cells can be effectively maintained 
at all times. Thus, up to a ten-fold higher infectivity by 
infecting cells on a porous membrane and allowing retrovirus 
supernatant to flow past them has been seen. This should allow 

30 titres of 10 9 after concentration. 



Upon isolation/concentration of virus, the substantially 
identical target cells are transduced by methods well-known in 
the art. 
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In applications when effector molecules with oncogenic poten- 
tial are present in the library it is important to use retro- 
viral supernat ant s which are non-infectious to humans. In 
these cases the ecotropic receptor can be stably introduced 
5 into the target cell of interest and viruses can be produced 
using ecotropic systems. The ecotropic receptor is a cationic 
amine* acid transporter protein (mCAT) , shown to be sufficient 
to confer susceptibility to ecotropic virus infection (Albrit- 
ton et al . 1989). Expression of this receptor in a variety of 

10 human cells, including lymphocytes, have been documented in-- 
the literature (Hitoshi et al. T 1998). The present inventors 
have demon st ra ted that int roduct ion of mCAT , both by stable 
trans feet ion and by transduction (using a retroviral vector 
encoding mCAT ) , yield target cells that are highly susceptible 

15 to infection by Bosc23 generated virions. 

Hence, the cell lines discussed above, and the other methods 
for producing retrovirus, are useful for production of virus 
by transient transf ect ion . The virus can be either used di- 
rectly or used to infect another retroviral producer cell so 
20 as to expand the library. 



Fusion partners 



As mentioned above, fusion of the expression product to at 
least one fusion partner which facilitates expression and/or 
purification/isolation and/or further stabilization of the 
25 expression product is often desired. 

In eukaryotes, the fusion partner is often a sorting signal or 
a targeting sequence. Such a sorting signal will be in the 
form of a signal patch or a signal peptide. The well-known 
function of a sorting signal is to effect export of an ex- 
30 pressed peptide into endoplasmic reticulum, into Golgi appara- 
tus, into lysosomes, into secretory vesicles, into mitochon- 
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Iria, into peroxisomes, or into the nucleus. Of course, also 
export to the membrane or out. of the cell are possibilities. 
Preferably, the sorting siqnal or targeting sequence is se- 
lect e ( i t r orn t h o g r o u p c o n s 1 s t i n g o 1 

a nuclear Localization signal ( N LS ) such as P ro- hy s-Lys- 
Lys-Arg-Lys-Va 1 (3V-10 large T antigen NLG), Ala-Arg-Arg- 
Ars-Arg-Pro (human rotinoio acid receptor-^ WLS) , Glu- 
G_l u-Vd L -G iii-Mnj-bys-A rg-G [ n - Lys -Leu (NFKh p r >0 ) , Glu-Glu- 
hy.-: -Arg-Lys-Arg-Thr-Tyr-G Lu (NFkB p65) , and Ala-Va 1-Lys- 
Arq-Pro-A 1 a -A 1 a-Thr- Ly s~ Lys -A la -G 1 y-Gln-Al a - Ly s-Ly s -Lys — 
Ly--. --Lou-Asp (Xenopus nuc loop 1 asm in NLS) ; 

a iiiembcarie anchoring sees-no* - such as those derived from 
CDo, ICAM-2, IL-8, GDI , and LEA- I, and a lipi dation se- 
quence such as a my r i s t y 1 a t i on or a palm i toy 1 a t ion se- 
quence ; 

a lysosomal sorting signal such as a lysosomal degrada- 
tion sequence, and a lysosomal membrane sequence 
a mitochondrial localization sequence such as a mitochon- 
drial matrix sequence, a mitochondrial inner membrane 
sequence, a mitochondrial int ermembrane space sequence, 
and a mitochondrial outer membrane sequence; an endoplas- 
mic reticulum localization sequence such as the sequence 
from calreticulin ( KDEL ) and the sequence from adenovirus 
E3/19K protein ( LULSRRS F 1 DEKKMP ) ; 

a peroxisome sequence such as the peroxisome matrix se- 
quence from Luciferase; 

a f arnesylation sequence such as LNPPDESGPGCMSCKCVLS; 

a geranylgeranylation sequence such as LTEPTQPTRNQCCSN ; 

a destruction sequence such as RTALGDIGN; and 

a secretory signal sequence such as the secretory signals 

from IL-2, growth hormone, preproinsulin, and influenza 

HA protein . 
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Enzyme modulators useful in the invention 



The art has demonstrated the existence of numerous effective 
peptide enzyme activity modulators. Especially the enzyme 
inhibitors are well-characterized. Non-limiting examples which 
5 are all incorporated by reference herein are listed in the 
following : 



BPTI/Kunltz family of protease Inhlhltors : 

Pancreatic trypsin inhibitor (BPTI) from Bos taurus; 
Spleen trypsin inhibitor from Bos taurus; 
10 Inter-alpha-t rypsin inhibitor light chain (bikunin) from Bos 
taurus,, Homo sapiens , Meriones unguiculatus , Mesocricetus 
auratus r Mus musculus, Sus scrofa f Pleuronectes platessa , and 
Rattus norvegicus , respectively; 

Inter-alpha- trypsin inhibitor from Equus caballus , Ovis aries, 
15 and Capra hircus, respectively; 

Hemolymph trypsin inhibitor A from Manduca sexta; 

Hemo lymph trypsin inhibitor B from Manduca sexta; 

Colostrum trypsin inhibitor from Bos taurus; 

Trypstatin from Rattus norvegicus ; 
20 Proteinase inhibitor from Tachypleus tridentatus; 

Serum basic protease inhibitor from Bos taurus; 

Chymotrypsin inhibitor SCI-III from Bombyx mori; 

Male accessory gland ser ine-prot ease inhibitor from Drosophila 
funebris; 

25 Protease inhibitor 5 II from Anemonia sulcata ; 

Chymotrypsin inhibitors SCI-I and SCI-II from Bombyx mori; 
Proteinase inhibitors SHPI and SHPI-2 from Stoichactis helian- 
thus; 

Isoinhibitor K from Helix pomatia; 
30 Trypsin inhibitor IV from Radianthus macrodactylus; 

Venom basic protease inhibitors IX and VIIIB from Bungarus 
fascia tus; 
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Venom basic protease inhibitors I and III from Vipera ammo- 
d y t o s a m m o dytes; 

Venom basic protease inhibitor II from Daboia russoili siamen- 
sis, Homachatus haemacha tus , and Naja n i vea, respectively; 
5 Venom basic protease inhibitors B and E from Dendroaspis 
pol yl cp:i s po lylepis; 

V ■ " n c m c h y m > t ryps m i n h i b iLur from Na j a n a j a ; 
Venom basic protease inhibitors I and K from Dendroaspis 
polyl epi s pol y I epi s ; 
10 Venom basic protease inhibitor K from Dendroaspis angusticeps 
Venom trypsin inhibitor from Eristocophis maomahoni and Naja 
naja , respect ively; 

Protease inhibitor from Sarcophaga bulla ta; 

Tissue factor pathway inhibitor from Homo sapiens, Oryctolagvs 
15 cuniculus, and Rattus norvegicus, respectively; 

Tissue factor pathway inhibitor 2 from Homo sapiens; 

Uterine plasmin/t rypsin inhibitor from Sus scrofa; 

Protease nexin II (fragment of Alzheimer's disease amyloid A4 

protein) from Homo sapiens, Mus musculus, rattus norvegicus, 
20 Macaca fascicularis and Macaca mulatta, respectively; 

Amyloid protein 2 from Homo sapiens and Rattus norvegicus, 

respectively; and 

Ornithodorin from Ornithodoros moubata, 

as well as inhibitors homologous therewith isolated from other 
25 sources than those explicitly mentioned. 

Serpln family of pxroteasG Inhibitors: 

Alpha-l-proteinase inhibitors ( alpha-l-anti trypsins ) from Equus 
caballus, Mus musculus, Cavia porcellus, Oryctolagus ounicu- 
lus, Bos taurus, Chinchilla villidera, Didelphis marsurpiales 
30 virginiana , Homo sapiens, Macropus eugenii, Mus caroli, Papio 
anubis, Sus scrofa, Rattus norvegicus , and Ovis aries, respec- 
tively; 

Alpha-l-ant ichymot rypsin from Homo sapiens; 
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Anti thrombin III from Homo sapiens, Bos taurus, Mus musculus , 
Ovis aries, Mesocricetus auratus, and Gallus gallus, respec- 
tively; 

Alpha-2-plasmin inhibitor ( alpha-2-antiplasmin ) from Bos 
5 taurus, Homo sapiens , and Mus musculus r respectively; 

Bomapin (Protease Inhibitor 10) from Homo sapiens ; 

Contrapsin from Mus musculus and Cavia porcel lus ; 

Cont rapsin-like protease inhibitors from Rattus norvegicus; 

Factor Xlla inhibitor from Bos taurus; 
10 Glia derived nexin (protease nexin I) from Homo sapiens, Mus 

musculus , and Rattus norvegicus , respectively; 

Heparin co-factor II from Homo sapiens , Mus musculus , 

Oryctolagus cuniculus , and Rattus norvegicus , respectively; 

47 kDa heat shock protein (serine protease inhibitor J6) from 
15 Mus musculus and Gallus gallus, respectively; 

Cl-inhibitor from Homo sapiens ; 

Leukocyte elastase inhibitor from Equus caballus , Homo sapi- 
ens, and Sus scrofa, respectively; 
Protein C inhibitor from Homo sapiens; 
20 Kallistatin from Homo sapiens ; 

Kallikrein-binding protein from Mus musculus and .Rattus 
norvegi cus, respectively; 

Maspin from Homo sapiens , Mus musculus , and Rattus norvegicus , 
respectively; 

25 Plasminogen activator inhibitor-1 from Bos taurus, Homo sapi- 
ens, Mus musculus , Mustela vison, and Rattus norvegicus r 
respectively; 

Plasminogen activator inhibitor-2 from Homo sapiens , Mus 
musculus, and Rattus norvegicus, respectively; 
30 Neuroserpin from Homo sapiens , Mus musculus , and Gallus gallus 
Cytoplasmic ant iprot einases 1, 2 and 3 from Homo sapiens 
Antitrypsin from Bombyx mori, respectively; 
Ant ichymotrypsins I and II from Bombyx mori; 
Alaserpin from Manduca sexta; 
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Serine protease inhibitor 2.1 from Rattus norvegicus; 
Serine proteinase inhibitor 1 from Cowpox virus, Rabbit pox 
virus, Swine pox virus, Vaccinia virus, and Variola virus, 
respect i vol y ; 

5 Serine proteinase inhibitor 2 from Sabbit pox virus, Vaccinia 
virus, and Variola vi rus, respectively; 

S e r i n o p r o tein a s e i n h i b 1 1 o r 3 f r on V a c e i r i i a v i r u s a n d V a r i o 1 a 

V i. .r U S , i_ ^ .i- ^j'z u l _l y ; 

Serpin 1 from Myxoma virus; 
10 Serine proteinase inhibitor from Halocynthla roretzl; 

Ice inhibitor from Cowpox virus ( th i ol -protease inhibitor); 

as well as inhibitors horn- doqous therewith isolated from other 

sources than those oxpl i c i 1 1 y rnent ioned . 

Kazal fam i ly of protease Inhibitors : 

15 Acrosin inhibitors from Bos taurus, Homo sapiens, Sus scrofa, 

and Macaca fasc icularis , respectively; 

Elastase inhibitor from Anemonia sulcata ; 

Ovoinhibi tor from Callus aallus; 

Rhodniin from Rhodnius prollxus; 
20 Pancreatic secretory trypsin inhibitor from Rattus norvegicus , 

Anguilla anguilla, Bos taurus, Canls famlllaris, Homo sapiens, 

Sus scrofa and Ovois aries, respectively; 

Pancreatic secretory trypsin inhibitor II from Rattus norvegi- 
cus; 

25 Double-headed protease inhibitor (submandibular gland) from 
Canls famlllaris , Fells silvestris catus, Meles meles, Pan- 
thera leo, and Vulpes vulpes, respectively; 
Trypsin inhibitor from Halocynthla roretzl; 
Tryptase inhibitor from Hlrudo medicinalis; 

30 Prostatic secretory glycoprotein from Mus musculus ; 

Ovomucoid (third domain) from Abux-ria plplle, Aepypodlus 
arfaklanus, Afropavo congensis , Alectorls chukar, Alectorls 
rufa, Anas platyrhynchos , Chloephaga plcta, Cyanochen cyanop- 



WO 00/05406 




44 

tera, Neochen jubata, Tadorna radjah, LophonetLa speculario- 
ides, Anas capensis , Aix galericulata , Aix sponsa , Sarkidior- 
nis melanotos , Alopochen aegyptiaca, Mergus cucullatus , Anhin- 
ga novaehollandiae , Anser anser anser, Anser indicus , Ansera- 
5 nas semipalmata , Arborophila torqueola , Argusianus argus, 

Ay thy a americana , Netta rufina, Balearica pavonina , Bambusi- 
cola thoracica , Bonasa umbellus, Branta canadensis , Anser 
canagicus , Callipepla squama ta castanogastric , Callipepla 
squama ta pallida , Carpodacus mexicanus , Carpococcyx renauldi , 

10 Casuarius casuarius, Casuarius bennetti, Cereopsis 

novaehollandiae , Chauna chavaria , Chauna torquata , Gallus 
gallus, Chrysolophus amherstiae , Chrysolophus pictus, Circus 
aeruginosus , Colinus virginianus , Corvus albus, Corvus monedu- 
la , Coscoroba coscoroba, Coturnix delegorguei , Coturnix cotur- 

15 nix j aponica , Crossoptilon crossopti Ion , Cygnus atratus , 
Cygnus olor, Oxyura j amaicensis , Oxyura vittata , Cyrtonyx 
montezumae , Dacelo novaeguineae , Dendrocygna arborea , 
Dendrocygna arcuata , Dendrocygna autumnalis , Dendrocygna 
bicolor, Dendrocygna eytoni , Dendrocygna viduata , Dromaius 

20 novae-hollandiae r Eudromia elegans , Francolinus afer coqui, 
Francolinus erckelii , Francolinus francolinus , Francolinus 
pondicerianus , Fulica atra, Gallinula chloropus , Gallirallus 
australis , Gallus varius, Geococcyx californianus, Grus 
carunculatus , Grus japonensis , Grus vipio, Anthropoides virgo, 

25 Guira guira, Guttera pucherani , Gyps coprotheres, Polyboroides 
radiatus r Aquila audax, Necrosyrtes monachus , Haliaeetus albi- 
cilia, Haliastur Indus, Larus ridibundus , Larus marinus, 
Vanellus spinosus, Leipoa ocellata, Lophura bulweri, Lophortyx 
calif ornica , Lophortyx gambelii , Lophura ignita, Lophura 

30 diardi, Lophura leucomelana f Megapodius freycinet, Meleagris 
gallopavo , Agriocharis ocellata , Nothoprocta cinerascens , 
Nothoprocta perdicaria , Numida meleagris , Aery Ilium vulturi- 
num , Nycticorax nycticorax , Opisthocomus hoazin, Oreortyx 
pictus, Ortalis vetula, Pavo crista tus , Favo muticus, Penelope 
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gacquacu, Pcnolopo superciliaris, Pcrdrix pcrdrix, Phas ianus 
co I ch i cus col ch i cus , Phas i anus versa color , Pha 1 ac rocorax 
su I c i ros t r i s , I'oda rgus strigoi dec , Pcd. ypl oc t. con bi c,i 1 ca ra t urn , 
Po lyp Loot von oinphanum, Polyborus plancus, PygoscoL is ado lie, 
5 Pha Licr^corax a 1 b i 'venter , Rhea amoricana, Pt erocnom ia pennata, 
Rhynchotus rut oscons , Rol 1 ulus roulroul , Scyth rops 
novaoho] landiao, Sphoniscus humboldt.i , Strut hio cimolus, 
Syrmaticus mikado, Syrmaticus reeves j i, Tinamus major, Turdus 
morula , Turnix sylvat ica , Tympanuchus cupido r Controcercus 

10 urophas lanus , Traqopan blythii , Tragopan canoti , Tragopan 

sa t yra , Tragopan t >: mini i nek i i , Lophophorus impej anus , Crossopti- 
lon auci tiun, Crosr>opti Ion ma ntchu r icnnp Lophura edwardsi , 
Lophura nycthemora f Lophura swinhoei f Pucrasia macrolopha , 
Catrous wall ichi i , Syrmaticus ollioti, Syrmaticus humiae, 

15 Syrmaticus soemmer r ingii , Lagopus leucurus , and Vultur gry- 
ph us, respectively, 

as well as inhibitors homologous therewith isolated from other 
sources than those explicitly mentioned. 

Soybean trypsin Inhibitor (Kxuxltz) family of protease inhibi- 
2 0 tors : 

Aspartic proteinase inhibitor from Solanum tuberosum; 
Cathepsin D inhibitors from Solanum tuberosum; 
Wound-induced aspartate proteinase inhibitor from Solanum 
t uberosum; 

25 Chymotrypsin inhibitor 3 precursor from Psophocarpus 
totra gonolobus ; 

Trypsin inhibitor from Adenanthera pavonina ; 
Trypsin inhibitor from Prosopsis juli flora; 
Trypsin inhibitor from Erythrina caffra; 
30 Trypsin inhibitor from Erythrina latissima; 

Chymotrypsin inhibitor from Erythrina varicgata; 
Trypsin inhibitors 1A, IB, and 2 from Psophocarpus 
tet ra gonolobus ; 
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Trypsin/chymot rypsin inhibitor from Alocasia macrorrhiza ; 
Trypsin inhibitor from Albizzia julibrissin; 
Trypsin inhibitor from Acacia confusa; 
Trypsin inhibitors A, B and C from Glycine max; 
5 Trypsin inhibitors KTIland KTI2 from Glycine max; 

Latex serine proteinase inhibitor from Carica papaya; 
Cysteine proteinase inhibitor PCPI 8.3 from Solanum tuberosum; 
and 

Kunitz-type inhibitors 1 and 2 (PKI-1) from Solanum tuberosum, 
10 as well as inhibitors homologous therewith isolated from other 
sources than those explicitly mentioned 



Potato inhibitor I family (protease inhibitors) : 

Trypsin/subtilisin inhibitor from Amaranthus caudatus ; 
Subtilisin inhibitor from Momordica charantia ; 
15 Wound-induced proteinase inhibitor I from Lycopersicon 

esculentum , Lycopersicon peruvianum, and Solanum tuberosum, 
respectively; 

Subtilisin inhibitors I and II from Phaseolus angular is; 

Proteinase inhibitor I from Solanum tuberosum; 
20 Chymotrypsin inhibitor 2A (CI-2A) from Hordeum vulgare; 

Chymotrypsin inhibitor 2B (CI-2B) from Hordeum vulgare; 

Chymotrypsin inhibitor 1A (CI-1A) from Hordeum vulgare; 

Chymotrypsin inhibitor IB (CI-1B) from Hordeum vulgare; 

Chymotrypsin inhibitor 1C (CI-1C) from Hordeum vulgare; 
25 Chymotrypsin inhibitor I, a, b and c subunits from Solanum 

tuberosum; 

Subtilisin inhibitor from Vicia faba; 

Ethylene-responsive proteinase inhibitor from Lycopersicon 
esculentum; 

30 Proteinase inhibitors I-A and I-B from Nicotiana tabacum; 
Eglin C from Hirudo medicinal is; 

Inhibitor of trypsin and Hageman factor from Cucurbita maxima; 
Trypsin inhibitor MCI-3 from Momordica charantia ; and 
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Trypsin inhibitor I from Nicotiana sylvestris , 

as well as inhibitors homologous therewith isolated from other 
sources than those exp Licit iy mentioned. 

Bovmian-Birk family of protease Inhibitors : 

5 Bowman-birk type protease inhibitors from Arachis hypogaea, 

~<_. ..^wx^^.u , ^i^";t'iu^. uinjuici ris , Glycine max, Triticum 

aestivum, Arachis hypogaea, Phaseolus vul gar is, Setaria itali- 
ca, Dolichos axillaris, Lonchocarpus capassa, Oryza sativa, 
Hordeum vulgare, Medicago scutellata, Phaseolus aureus, 
10 Phaseolus lunatus, Vicia angusti folia , Vicia faha, and Vigna 
unguioula ta , respect! vely; and 

Wound induced trypsin inhibitors from Medicago sativa and Zea 
mays, respectively, 

as well as inhibitors homologous therewith isolated from other 
15 sources than those explicitly mentioned. 

Squash Inhibitor family ; 

Elastase inhibitor from Momordica charantia; 

Trypsin inhibitors I, II, and III from Lagenaria leucantha; 

Trypsin inhibitor I from Citrullus vulgaris; 
20 Trypsin inhibitors I, III, and IV from Cucurbita maxima; 

Trypsin inhibitors I and II from Luff a cylindrica; 

Trypsin inhibitors I and II from Momordica charantia; 

Trypsin inhibitor I from Momordica repens; 

Trypsin inhibitor II from Bryonia dioica; 
25 Trypsin inhibitors IIB and IV from Cucumis sativus; 

Trypsin inhibitor II from Echallium elaterium; 

Trypsin inhibitors I, II and III from Cucumis melo var. Cono- 
mon; 

Trypsin inhibitors II and III from Cucurbita pe P o; and 
30 Trypsin inhibitor A from Momordica charantia, 

as well as inhibitors homologous therewith isolated from other 
sources than those explicitly mentioned. 
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Streptomyces subtilisin Inhibitor (SSI) family: 

Subtilisin inhibitor from Streptomyces a lbogriseolus ; 
Subtilisin inhibitor-like protein-1 from Stroptomyces cacaoi; 
Trypsin inhibitor STI2 from Streptomyces longisporus; 
5 Subtilisin inhibitor-like protein-2 from Stroptomyces rochei; 
Subtilisin inhibitor-like protein-3 from Streptomyces coeli- 
color; 

Subtilisin inhibitor-like protein-4 from Streptomyces laven- 
dulae; 

10 Subtilisin inhibitor-like protein-8 from Streptomyces virgin- 
iae; 

Protease inhibitor S1L-V3 from Streptoverticillium eurocidi- 
cus; 

Protease inhibitor SIL-V1 /SIL-V4 from Streptoverticillium 
15 flavopersicus; 

Protease inhibitor SIL-V5 from Streptoverticillium luteoverti- 
cillatus; 

Protease inhibitor SIL-V2 from Streptoverticillium orinoci; 
Alkaline protease inhibitor 2C 1 from Streptomyces griseoincar- 
20 natus; 

Protease inhibitor from Streptomyces lividans ; and 
Plasminostreptin from Streptomyces antifibrinolyticus, 
as well as inhibitors homologous therewith isolated from other 
sources than those explicitly mentioned. 

2 5 Bombyx family of protease inhibitors 

Fungal protease inhibitor F (FPI-F) from Bombyx mori r as well 
as inhibitors homologous therewith isolated from other sources 
than Bombyx mori . 
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Wap-type r Four- disulfide Core* Proteinase Inhibitors 

Ant i leukoproteinase 1 from Homo sapiens and Mus musculus, re- 
s p e • • t i v e 1 y ; 

Elafin from Homo sapiens and Sus scrofa, respectively; and 
5 Ghelonianin (basic protease Inhibitor) from red sea turtle, 

as woLl as inhibitors homologous therewith isolated from other 
sources than those explicitly mentioned, 

Hirudin family of protease inhibitors 

Hirudins from Hirudo medicinalis and Hirudinaria manillensis , 
10 respectively, as well as inhibitors homologous therewith iso- 
late. J from ether sources than those explicitly mentioned. 

Factor Xa Inhibitors 

Antistasin from Ha emen teria officinalis, Haementeria ghili- 
anii, Hirudo medicinalis, Hirudo nipponia, and Hydra 
15 magnipapillata t respectively, as well as inhibitors homologous 
therewith isolated from other sources than those explicitly 
mentioned . 



As carls trypsin Inhibitor family 

Chymotrypsin/elastase inhibitors from Ascaris suum and 
Trypsin inhibitor from Ascaris suum as well as inhibitors 
homologous therewith isolated from other sources than Ascaris 
suum . 



Cy stat In family of protease Inhibitors 

Leukocyte cysteine proteinase inhibitors 1 and 2 from Sus 
25 scrofa; 

Stefins 1, 2 and 3 from Mus musculus; 
Cystatins Al, A5, A8 and B from Sus scrofa; 
Cystatins A and B from Bos taurus; 
Stefin C from Bos taurus; 
30 Cystatin B from Ovis aries; 
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Cystatins A, C, 0, M, SN, S and SA from Homo sapiens ; 
Cystatins B and C from Mus musculus ; 
Cystatins C and 5 from Rattus norvegicus ; 

Cystatins C from Macaca mulatta and Saimiri sciureus f respec- 
5 tively; 

Cystatin alpha (epidermal thiol proteinase inhibitor) from 
Ra ttus norvegi cus; 

Cystatin B (liver thiol proteinase inhibitor) from Homo sapi- 
ens and Rattus norvegicus, respectively; 
10 Cystatin (colostrum thiol proteinase inhibitor) from Bos 
taurus; 

Cystatins from Callus gallus and Coturnix coturnix j aponica , 
respectively; 

Cystatin from Cyprinus carpio; 
15 Cystatin from Bitis arientas; 

Cystatin 1 from Zea mays; 

Ory zacystat in- I from Oryza sativa; 

Oryzacystatin-I I from Oryza sativa; 

Cystatin A from Helianthus annuus; 
20 Cystatin B from Helianthus annuus; 

Cystatin from Vigna unguiculata; 

Onchocystat in from Onchocerca volvulus; 

Cysteine proteinase inhibitor from Solanum tuberosum; 
Cystatin WCPI-3 from Wisteria floribunda; 
25 Cystatin from Glycine max; and 

Kininogens from Bos taurus, Homo sapiens and Rattus norvegi- 
cus , respectively, 

as well as inhibitors homologous therewith isolated from other 
sources than those explicitly mentioned . 

30 Ca.lpa.ln family of cysteine protease Inhlhltozrs 

Calpastatin from Bos taurus, Cercopithecus aethiops , Homo 
sapiens , Mus musculus , Sus scrofa, Oryctolagus cuniculus , 
Rattus norvegicus, and Ovis aries f respectively^ 
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as well as inhibitors homologous therewith isolated from other 
sources than those explicitly mentioned. 

Tissue Inhibitor of metalloprotelnases family 

Met a 1 loprote inase inhi tutor 1 from Bos taurus , Homn sapiens , 
5 Mas muscu I us , Papio cynocopha Lus, Bus scrofa, Oryciolaqus 
cun i cu.l us , Ra t 1 us norvoqicus , and Ov i s a n es, respect i ve ly ; 
h cLdi r ( : ' p r o i: e x 1 1 a s t :■ i n h _t O 1 1 or A Irom Bo s taurus , 11 oin o sap i e n s , 
Mus muscu lus , Rattus novvc-qicus, and Callus qa 1 lus>, respec- 
t i ve 1 y ; and 

10 Me ta i_l oproteinasc inhibitor 3 from Bos taurus, Homo sapiens, 
Mus muscu lus , Rat tus norvoqicus , and Ca 1 lus qalius r respec- 
t ive ly , 

as well as inhibitors homologous therewith .isolated from other 
sources than those explicitly mentioned. 

15 CarboxypeptlcLa.se A Inhibitors 

Carboxypept idase A inhibitor from Ascaris suum, as well as 
inhibitors homologous therewith isolated from other sources 
than As car is suum. 

Meta.llocarboxypeptlda.se Inhlbl tors 

20 Metallocarboxypeptidase inhibitors from Lycopersicon osculen- 
tum and Solanum tuberosum, respectively, as well as inhibitors 
homologous therewith isolated from other sources than those 
explicitly mentioned . 

Angiotensin- converting enzyme Inhibitors 

25 Angiotensin-converting enzyme inhibitor from Thunnus albaca- 
res; 

Angiotensin-converting enzyme inhibitors from Bothrops insula- 
ris; 

Angiotensin-converting enzyme inhibitors from Bothrops jarara- 



3 0 ca; 
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Angiotensin-convcrtinq enzyme inhibitor from Agkistrodon halys 
blomhoffi ; 

Angiotensin-convert ing enzyme inhibitor from Agkistrodon halys 
pallas; and 

5 Angiotensin-convert ing enzyme inhibitor from Vipera aspis, 

as well as inhibitors homologous therewith isolated from other 
sources than those explicitly mentioned. 



Protease Inhibitors not belonging to a specific family 

Ecotin from Escherichia coli; 
10 Metalloproteinase inhibitor from Streptomyces nigrescens; 

Proteinase inhibitor from Erwinia chrysanthemi ; 

Proteinase inhibitor from Pseudomonas aeruginosa ; 

Protease A inhibitor 3 from Saccha romyces cerevisiae; 

Protease B inhibitors 1 and 2 from Saccharomyces cerevisiae; 
15 Major pepsin inhibitor PI-3 from Ascaris suum; 

Intracellular proteinase inhibitor from Bacillus subtilis; 

Proteinase inhibitor PTI from Solanum tuberosum; 

Proteinase inhibitor PCI-I from Solanum tuberosum; 

Proteinase inhibitor IIA from Solanum tuberosum; 
20 Proteinase inhibitor IIB from Solanum tuberosum; 

Proteinase inhibitors A and B from Sagittaria sagitti folia ; 

Proteinase inhibitor from Solanum melongena; 

Trypsin inhibitor from Brassica napus; 

Trypsin inhibitor 2 from Sinapis alba; 
25 Trypsin inhibitor from Zea mays; 

Trypsin inhibitor from Sinapis arvensis; 

Trypsin inhibitor from Trichosanthes kirilowii ; 

Wound-induced proteinase inhibitor II from Lycopersicon 

esculentum; 

30 Protease inhibitors LCMI I and PMP-D2 from Locusta migratoria; 
Protease inhibitor from Bacillus brevis; 

Marinostatins C-l, C-2 and D from Alteromonas sp. ; and 
Host protease inhibitor from bacteriophage T4 , 
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as well as inhibitors homoloqous therewith isolated from other 
s our o ( > .< ; 1 1 1 a n t h o s e o x p 1 i c i f 1 y me n t i o n e d . 

Cereal alpha- amylase/ trypsin inhlhltor family 

A Lpha amy La so / tryps i n inhi In t or CM 1 f mm Tri t i cum a as ti vum; 
5 Alpha amy laso/t rypsin inh i 1;. i tor CM/ 1 r<>m Tr iticum a est i vum; 

Alpha amy 1 a so / tryps i n i nhi bi b~»r CM 3 fr-m Tri ticum aesti vum; 

orpna amy _i aso/ trypsin inhibit- a: < 'M I 0 from Triticum aest ivum; 

Alpha amylase/trypsin inhib i t - a: CM 17 from Tri ticum aest ivum; 

A 1 p h a amy .1 a s e i n h i b i t o r 0 .19 t" r om Tr i 1 1 c urn a e s t i v urn ; 
10 Alpha amylase inhibitor 0.2 8 from Tr iticum aestivum; 

Alpha amy 1 as. • inhibitor 0.53 from Triticum aestivum; 

Alpha amylase inhibitor WDAI-? from Triticum aestivum; 

Alpha amylase/trypsin inhibitor CMA from Hordeum vuigare; 

Alpha amylase/trypsin inhibitor CMB from Hordeum vulgare; 
15 Alpha amylase/trypsin inhibitor CMC from Hordeum vulgare; 

Alpha amylase/trypsin inhibitor CMD from Hordeum vulgare; 

Alpha amylase inhibitor CME from Hordeum vulgare; 

Alpha amylase inhibitor BMAI-1 from Hordeum vulgare; 

Alpha amylase inhibitor BDAI-1 from Hordeum vulgare; 
20 Alpha amylase/trypsin inhibitor from Eleusine coracana ; and 

Trypsin/f actor XI la inhibitor from Zea mays, 

as well as inhibitors homologous therewith isolated from other 
sources than those explicitly mentioned. 

Alpha-amylase/ trypsin Inhlhltor homologous to thaumatln 

25 Alpha-amylase/trypsin inhibitor from Zea mays as well as 

inhibitors homologous therewith isolated from other sources 
than Zea mays. 

Alpha-amylase/suhtlllsln Inhlhltor family 

Alpha-amylase/subtilisin inhibitor from Hordeum vulgare; 
30 Alpha-amylase/subt ilisin inhibitor from Triticum aestivum; and 
Alpha-amylase/subtilisin inhibitor from Oryzae sative f 
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as well as inhibitors homologous therewith isolated from other 
sources than those explicitly mentioned. 



Inhibitors of insect alpha-amylases 

Small protein inhibitor of insect alpha-amy lases 1, 2 and 3 
5 from Sorghum bicolor milo as well as inhibitors homologous 
therewith isolated from other sources than Sorghum bicolor 
milo. 



Inhibitors of mammalian alpha- amylases derived from Streptomy- 
ces species 

10 Alpha-amylase inhibitor HAIM I from Streptomyces griseospo- 
reus ; 

Alpha-amylase inhibitor PAIM I from Streptomyces ol ivaceoviri- 
dis; 

Alpha-amylase inhibitor HAIM II from Streptomyces griseospo- 
15 reus; 

Alpha-amylase inhibitor PAIM II from Streptomyces 
ol i va ceovi ri di s ; 

Alpha-amylase inhibitor AI-3688 from Streptomyces aureofaci- 
ens; 

20 Alpha-amylase inhibitor Z-2685 from Streptomyces rochei; and 
Alpha-amylase inhibitor HOE-4 67A from Streptomyces tendae, 
as well as inhibitors homologous therewith isolated from other 
sources than those explicitly mentioned. 



Trehalase inhibitors 

25 Trehalase inhibitor from Periplaneta americana as well as 

inhibitors homologous therewith isolated from other sources. 



Polygalacturonase inhibi tors 

Polygalacturonase inhibitors from Phaseolus vulgaris and Pyrus 
communis as well as inhibitors homologous therewith isolated 
30 from other sources than those explicitly mentioned. 
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Fucosyl transferase inhlbi tars 

Fuctinins from Rattus norvegicus as well as inhibitors homolo- 
c j o u s t: h o r o w i. t h i. s o 1 a t o c i from o t h or so u r c: e s . 

Protein kinase C Inhibitors 

5 1-1-3-3 Protein beta/alpha from Bos taurus, Ovis <iries, Homo 
sapiens, and Rattus no rvogicus , respect ivoly; 

Jei-j.~^ Protein epsiion from Mus musculus , Ovis aries, Homo 
sapi ens , < : md Ra 1 1 us norvegi cus , respect i ve 1 y ; 

14-3-3 Protein eta from Bos taurus , Mus musculus , and Rattus 
10 no rveqi cus , respectively; 

1-1-3-3 Protein oarnma from Bos taurus, Ovis aries, and Rattus 
n o rve g i c u s , r es p e c t i v e 1 y ; 

1-1-3-3 Protein zeta/cielta from Bos taurus, Mus musculus, Ovis 
a vies, Homo sapiens, and Rattus norvegicus , respectively; 
15 Hint protein from Bos taurus, Rattus norvegicus , Homo sapiens, 
Mus musculus r and Oryctolagus cuniculus , respectively; and 
14 kDa zinc-binding protein from Brass ica juncea and Zea mays, 
respectively, 

as well as inhibitors homologous therewith isolated from other 
20 sources than those explicitly mentioned. 

cAMP- dependent protein kinase inhihi tors 

cAMP-dependent protein kinase inhibitor (muscle/brain form) 
from Homo sapiens / Oryctolagus cuniculus , Mus musculus , and 
Rattus norvegicus f respectively, and 
25 cAMP-dependent protein kinase inhibitor (testis form) from Mus 
musculus, and Rattus norvegicus, respectively, 

as well as inhibitors homologous therewith isolated from other 
sources than those explicitly mentioned. 
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Cyclic nucleotide phosphodiesterase Inhlhl tor 

Cyclic nucleotide phosphodiesterase inhibitor from 
Dictyostcl ium discoideum as well as inhibitors homologous 
therewith isolated from other sources. 

5 Protein phosphatase inhlhl tors 

Protein phosphatase inhibitor 1 from Homo sapiens , Oryctolagus 
cuniculus , and Rattus norvegicus, respectively; 

Protein phosphatase inhibitor 2 from Homo sapiens , Oryctolagus 
cuniculus , and Rattus norvegicus r respectively; 
10 Heat-stable protein phosphatase 2a inhibitor I1PP2A from Bos 
taurus and Homo sapiens , respectively; and 
Phosphatase RAPA inhibitor from Bacillus subt ills , 
as well as inhibitors homologous therewith isolated from other 
sources than those explicitly mentioned. 

15 TCD/MRS6 ±amlly of GDP dissociation inhibitors 

Secretory pathway GDP dissociation inhibitor from Saccharomy- 
ces cerevisiae; 

Rab GDP dissociation inhibitor alpha from Bos taurus, Homo 
sapiens , Mus musculus , and Rattus norvegicus , respectively; 
20 Rab GDP dissociation inhibitor beta from Homo sapiens , Mus 
musculus , and Rattus norvegicus , respectively; 

Rho GDP-dissociation inhibitor 1 from Bos taurus, Homo sapi- 
ens, Caenorhabditis elegans and Cavia porcellus , respectively; 
Rho GDP-dissociation inhibitor from Saccharomyces cerevisiae ; 
25 Rho GDP-dissociation inhibitor 2 from Homo sapiens and Mus 
musculus r respectively; and 

Rho GDP-dissociation inhibitor from Mus musculus , 

as well as inhibitors homologous therewith isolated from other 

sources than those explicitly mentioned. 
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ATP as e Inhibitors 

Mitochondrial ATPase inhibitors from Bos taurus , Caenorhah- 
ditir* eleijans, Pichia j id m i : , Sus scrofa, Rattus norveglcus , 
and Saccha rcvnyces cerev isi at >, respect i vely , and 
[ 3 S. .v i ium/potass iuin ATPase inh i b i. tor f rom Sus scro t a , 

as well as inhibitors homologous therewith isolated from other 
sources than those oxpl icitly mentioned. 

Phosphollpase A2 inhlhltory proteins 

Annex in I from Bos taurus, Homo sapiens , Mus musculus , Sus 
.10 scrofa, Rattus norvegicus, Oryctolagus cuniculus , Cavia cut- 
leri, Callus galdus, and Co Luniba livia, respectively; 
Annexin III from Homo sapiens and Rattus norveqicus r respec- 
t i ve Ly; 

Annexin V from Homo sapiens, Mus musculus, Rattus norvegicus, 
15 and Callus gallus, respectively; 

Uteroglobulin from Oryctolagus cuniculus and Lepus capensis , 
respectively; and 

Phospho 1 ipase A2 inhibitor from Trimeresurus flavovi rldis , 
as well as inhibitors homologous therewith isolated from other 
20 sources than those explicitly mentioned. 

Rlfbonucl ease InhlJbl tars 

Barstar from Bacillus amyloliquefaciens; 

Ribonuclease inhibitor from Homo sapiens, Sus scrofa, and 
Rattus norvegicus r respectively, as well as inhibitors homolo- 
25 gous therewith isolated from other sources than those 
explicitly mentioned. 

RNA polymerase Inhibitors 

Bacterial RNA polymerase inhibitors from Bacteriophages T3 and 
T7 as well as inhibitors homologous therewith isolated from 
30 other sources. 
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DNA-entry nuclease inhibitors 

Competence protein J from Bacillus suhtilis as well as inhibi- 
tors homologous therewith isolated from other sources. 

Beta- lactamase inhibitors 

5 Be ta-lactamase inhibitor from 5trepto/nyces clavuligerus as 
well as inhibitors homologous therewith isolated from other 
sources . 

To this list of enzyme inhibitors can be added functionally 
related inhibitors such as the inhibitor of calcium transport 
10 seminaiplasmin from Bos taurus and Mus musculus , respectively, 
as well as inhibitors homologous therewith isolated from other 
sources . 

To the above list of suitable scaffold molecules to be used in 
the invention should be added other agents having an effect on 
15 the activity of enzymes. One interesting candidate is 
thioredoxin . 

All of the above-listed scaffold molecules can be substituted 
with an effective part of the complete molecule. 

Properties of the scaffold molecule 

20 Since the peptide library presumably can reach every compart- 
ment of a cell, it is beneficial if the scaffold enzyme 
inhibitor protein is not too large, and that it is stable 
towards e.g. proteolytic attack and insensitive to the redu- 
cing environment inside eukaryotic cells. Hence, its function 

25 should preferably not be dependent on the formation of disul- 
fide bridges, since these are not formed in the cytosol or 
nucleus of such cells. In addition the scaffold protein should 
contain one or more exposed loops in which peptides can be 
inserted without markedly changing the structure or stability 
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of the protein. On this basis the chymotrypsin inhibitor 2A 
(cf . the discussion below) is a suitable scaffoLd but other 
seat folds such as the examples mentioned below could also be 
us ed . 

5 The members of the peptide Library introduced into target 

cells can effect change of the target cells' phenotype either 



the method of the invention renders possible the identifica- 
tion of not only the peptide or ribonucleic acid sequence 
10 which is responsible for the direct or indirect effect on 

phenotype, but also allows for the purification, isolation and 
identification of the target molecule with which the peptide 
in t.e ruct s . 

in the present specification the barley-derived chymot rypsin 
15 inhibitor 2A has been used by way of example, mainly because 
this protein is very well-characterized and extremely stable. 
It will be understood, however, that the use of this ensyme 
inhibitor in the Examples in no way should limit the scope of 
the present invention. 

20 Barley chymotrypsin inhibitor 2A belongs to a large family of 
homologous protease inhibitors mainly found in plants. This 
family includes barley chymotrypsin inhibitors 1A, IB, 2A and 
2B, potato inhibitor I , wound-induced tomato inhibitor I, 
ethylene-responsive tomato inhibitor, wild tomato fruit 

25 inhibitor I, a subtilisin inhibitor from broad bean, adzuki 
bean subtilisin inhibitor, pumpkin t rypsin/Hageman factor 
inhibitor, bitter gourd inhibitor, protoplast-specific trypsin 
Inhibitor from Nicotiana sylvestris, tobacco subtilisin inhi- 
bitor, amaranth trypsin/subtilisin inhibitor, and beach 

30 canavalia subtilisin inhibitor. The only member of this 

inhibitor family that is of non-plant origin is the leech 
elas tase/ca thepsin G inhibitor eglin C. 



i uiaijou'.MiiiMdiiL ) i n l orac c. l on . however, 
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CI-2A was oriqinally purified from the endosperm of the 
high-lysine barley Hiproly and shown to be a tight binding 
inhibitor of the microbial subtilisins Carlsberg and Novo as 
well as of chymo t rypsin . The N-terminal amino acid residue in 
5 CI-2A has a b Locked amino group as direct amino acid sequen- 
cing was unsii'Xessful. However, most of the amino acid se- 
quence of CI-2A has been determined at the protein level. In 
addition, it was shown that CI-2A purified from barley is 
N-terminally processed either during synthesis and storage in 

10 the endosperm or in the process of purification. The absence- 
of the 17 N-terminal amino acid residues does not influence 
the complex formation of CI-2A with subtilisins. Combining the 
results of amino acid sequencing and cDNA sequencing it has 
been deduced that CI-2A consists of 83 amino acid residues in 

15 a single polypeptide chain containing no disulfide bonds. The 
blocked N-terminal amino acid residue is serine. The reactive 
site in CI-2A has been determined to be the Met59-Glu60 pep- 
tide bond and the residues in the region Ile56-Arg62 have been 
demonstrated to be involved in the intermolecular contacts 

20 between inhibitor and protease. 

The three-dimensional structures of uncomplexed CI-2A as well 
as of CI-2A complexed with subtilisin Novo are known from 
X-ray crystallography. The three-dimensional structure of 
CI-2A has also been determined using NMR spectroscopy, reveal- 

25 ing that the reactive loop of CI-2A is dynamic. CI-2A consists 
of a single a-helix docking against four p-strands. The sur- 
face loop stretches across the free side of the sheet and is 
composed of eight residues: Gly54-Tyr61. In contrast to most 
enzyme inhibitors, CI-2A lacks disulfide bonds as well as 

30 glycosylation sites. In the structures determined, only the 64 
C-terminal amino acid residues are defined (L20-G83); this 
truncated version retains the functionality of the native 
protein. Comparing the complexed form with the two uncomplexed 
forms of CI-2A reveals few differences. The most notable 
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difference rs that the reactive site loop seems to have a less 
ordered s t ructure in the uncomp lexed forms than in the 

compiexed t o cm. The thre d linen sionai structure of CI - 2 A in 

complex with subt i lisin Move has also been compared to the 
5 t h r c e - d i me n s i o n a 1 s t r uct u r e o f e g 1 in C in c omp 1 e x w i t h 
subt i .1 is in Car lsberg . The two homologous inhibi tors have 
highly similar secondary and tertiary structures . 

Recombinant variants of N-te rmi nal ly t runcated CI -2A 

(CI -2A (L20-G83) ) and CI-2A with an N-terminal Asp-Pro exten- 

10 sion have been widely used to study the folding and stability 
of C1-2A. The structure of CI-2A in complex with subtilisin 
Novo has revealed that the number of intermolecular contacts 
between inhibitor and pre- tease in the P4-P1 region 
( I leSG-MetS9) of the inhibitor are much larger than in the 

15 PI 1 - P 3 f region (Glu60-Arg62 ) . 

Further aspects of the invention 

Having identified a modulator according to the above-detailed 
methods of the invention it is normally of interest to provide 
large quantities thereof for the purposes of further research 
20 and development, including possible identification of the 

target molecule with which the modulator physically interacts. 

Therefore, the invention also pertains to a method for the 
preparation of a replicable expression vector, the method com- 
prising the steps of identifying a modulator by the methods of 
25 the invention, and subsequently 

isolating or synthesizing a nucleic acid sequence which 

encodes the modulator, and 

engineering a replicable expression vector comprising an 
operon which comprises, in the 5 f -3' direction and in 
30 operable linkage, 1) a promoter for driving expression of 

the nucleic acid sequence, 2) optionally a nucleotide 
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sequence encoding a leader peptide, 3) the nucleic acid 
sequence, and 4) a termination signal. 

Such methods are widely known in the field of genetic 
engineering and molecular biology. The skilled person will 
5 find suitable guidance in e.g. Sambrook J, Fritsch EF, Mani- 
atis T. 1989. "Molecular cloning: A laboratory manual", 2nd 
ed. Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. 

In this part of the invention it is preferred that the pro- J " 
moter is inducible; constitutive promoters are not excluded 

10 though. Depending on the choice of host cell to carry and 
express the vector, the promoter is selected from the group 
consisting of a bacterial promoter, a fungal promoter such as 
a yeast promoter, and a mammalian promoter. The vector can be 
in the form of a plasmid, a phage, a cosmid, a minichromosome , 

15 or a virus, again depending on choice of host cell and other 
considerations applying for the specific case. However, it is 
most preferred that the expression vector is capable of being 
integrated into the genome of a suitable host cell, since the 
expression therein will then be more stable over time than is 

20 the case with non-chromosomal transformation of the host cell. 

Well-known vector systems are based on bacterial plasmid 
pBR322, X-phage, and yeast plasmid YRp7 , but other suitable 
and feasible choices are known to the skilled person. 

After having provided a suitable expression vector as outlined 
25 above, it is preferred that the modulator is produced by 

transforming a suitable host cell with an expression vector 
prepared as described above. Such a host cell can be bacterial 
(e.g. E. coli, Bacillus suhtilis, or any other suitable bacte- 
rial host cell), a fungus (e.g. a yeast cell such as 
30 Saccharomyces cerevisiae or Piccia pastoris) or a plant, 

insect or mammalian cell (which can be any of the above-dis- 
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cussed ceLLs or cell-types suitable for use as the "substan- 
tially identical cells" in the method of the: invention). 

After havinq provided a producer cell line as doser ibod above 
it is nc'W possible to preiur;o the modu 1 a t< r of the Invention 
5 by 'jrowirKi the transformed col 1 prepared as described above in 
a culture medium under conditions which facilitates expression 



subsequently harvesting the express i_on product i rem the cell 
and/or the culture medium. Alternatively, the modulator can be 

10 produced by synthesizing the modulator by moans of chemical 

synthesis on the basis of the sequence determined in step (e). 
In the case that the modulator is a peptide, the well-known 
techniques of solid- or liquid-phase peptide synthesis can be 
employed and also if the modulator is a ribonucleic acid, me- 

15 thods for synthetic production thereof are readily available. 

An important part of the invention pertains to the isola- 
tion/identification of the target biomolecules which are enga- 
ging the modulator, the identification, isolation and produc- 
tion of which is described above. Hence an important part of 

20 the invention pertains to a method for isolating and/or iden- 
tifying a target biomolecule, the method comprising providing 
a modulator according to the methocis described herein and 
subsequently using the modulator as an affinity ligand in an 
affinity purification step so as to isolate the target 

25 biomolecule from a suitable sample. The affinity purification 
step can e.g. be an affinity chromatographic step, an affinity 
mass spectrometry step, or a co- immunoprecipi tat ion step. 
However, any suitable method for affinity-based isola- 
tion/purification can be employed. 

30 Alternatively, the modulator can be used as a probe against a 
cDNA library derived from the substantially identical cells or 
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as bait in a two- or three-hybrid system (bacterial, fungal or 
mammalian ) . 

It is preferred that the target biomolecule is a peptide or a 
nucleic acid, since this also allows for sequence determina- 
5 tion thereof. 

The potential use of such a target biomolecule is to employ it 
in a drug-development program. For this purpose it is often 
useful to resolve or get information about the 3-dimensionaL 
structure of the target biomolecule (by means of methods 
10 available, e.g. X-ray diffraction studies, NMR analysis, 
circular dichroism, etc) . 

Having isolated a target biomolecule as described above, the 
invention allows for the rational selection of a chemical com- 
pound to be used as a putative drug candidate in drug develop- 
15 ment, the method comprising the steps of 

- assaying a library of chemical compounds for interaction 
with a target biomolecule which has been de novo isolated 
according to the methods of the invention, and 

- selecting compounds which interact significantly with the 
20 target biomolecule. 

Such an identified drug candidate can be a lead compound or a 
drug candidate as such. A "lead compound" is in the present 
application understood as being a compound which is not in 
itself suitable as a drug but which exhibits a number of 

25 characteristics which are interesting when viewed from the 
point of view of medical therapy. The reason such a lead 
compound is unsuitable could be toxicity, unsuitable 
pharmacokinetic or pharmacodynamic properties, difficulties 
relating to preparation and purification etc. In such cases, 

30 the lead compound is used as a model for de novo synthesis of 
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other chemical compounds which are designed so as to be re- 
lated to the active part of the lead compound in 3D structure 
and distribution o ; charged, polar and non-polar group-. 

This approach can be refined by i nitia 1 ly identif yinq the mem- 
5 bers of the library by methods of structure-based or non- 
structure based computer drug-modelling. Suitable non-struc- 
^ a r ^ ^ ^. ^ v« in., l . i ■. o .:i i ^ u .1 ^ j i_> ^ ^ <a in u o 5 , 5 G 7 , z o / a n a u s 
5, 025, 383; a method known as CoMFA) . An alternative is HASL 
(Hypothetical. Active Site Lattice; Hypothesis Software). Both 
10 these methods are based on 3D-QSAR. A feasible structure-based 
approach is e.g. disclosed in T WO 95/06293. 

Finally, a very important part of the present invention per- 
tains to a method for the preparation of a medicinal product, 
the method comprising the steps of 
15 a) selecting a chemical compound by the methods of the in- 
vention described above, 
b) performing pre-clinical tests with the chemical compound 
in order to assess the suitability thereof as a medicinal 
product, 

20 c) entering, if the chemical compound is deemed suitable in 
step (b) , clinical trials using the chemical compound in 
order to obtain market authorization for a medicinal 
product including the chemical compound as a pharmaceuti- 
cal ly active substance, and 

25 d) upon grant of a market authorization, admixing the chemi- 
cal compound with a pharmaceutical ly acceptable carrier, 
excipient or diluent and marketing the thus obtained 
medicinal product . 

In other words, also encompassed by the present invention is a 
30 method for developing a medicinal product, the method compri- 
sing that a modulator identified according to the method of 
the invention serves as a lead compound in the development 
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phase or wherein a target biomolecule isolated/ identified 
according the invention serves as an interaction probe against 
putative drug candidates in the drug discovery phase. 

The above-outlined methods should of course take into conside- 
ration all necessary requirements in order to meet GCP and GMP 
standards. At any rate, this method is completely depending on 
the initial provision of the modulator identified according to 
the invention. 

LEGEND TO THE FIGURES 

Fig. 1: A schematic representation of pCMVbipep/C I -2A with the 
functional cis-elements found in pCMVbipep indicated. 
At the top, a schematic representation of pCMVbipep is pre- 
sented and the CI-2A cDNA region is expanded in the illustra- 
tions below showing the position of the various signal pep- 
tides (SEQ ID NOs: 35-37) present in pCMVbipepER/CI -2A, 
pCMVbipepNLS/CI-2A, and pCIMVbipepSL/CI-2A, respectively. The 
extra amino acids added to CI-2A are written in one letter 
code with the essential amino acid sequences required for 
function underlined. Abbreviations: nuclear localization 
signal, NLS; endoplasmic reticulum, ER; secretoric leader, SL; 
retention signal, RS . 

Fig. 2: Total extracts from CMVbipep/CI-2A transduced cells 
are capable of inhibiting the protease activity of subtilisin. 
Subtilisin was incubated with increasing amounts of extracts 
from the indicated cell lines either transduced with CMVbipep 
(NIH-3T3, U20SmCAT, and 293mCAT) or CMVbipep/CI-2A (NIH- 
3T3/CI-2A, U20SmCat/CI-2A, and 2 93mCat /CI-2A) and subsequently 
assayed for residual proteolytic activity. Each reaction with 
a given extract concentration was determined in triplicate and 
the shown velocities are based on the mean values. 
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Fig. 3: Nuclear extract from CMVbipepNLS/CI-2A, but not from 
CMVbipep/CI-2A transduced NIH-3T3 cells exerts CI-2A activity. 

A) Subtilisin was incubate.] with either 2 or 10 pi of nuclear 
extract from NIH-3T3 cells transduced with either CMVbipep 

5 (-), CMVbipep/CT-2A (CI-2A), or CMVbipepNLS /CI -2A (NLS/CI-2A) 
, i n d s ubsequontly a s s a y e d for proteolytic a c t i v i t y . 

B) As for A except that 0.-! or 2 pi total cell extract was 

^ ._:» < .i . i<uCii l cu^l. wiLii ci Lj _L v LJ i 1 t: A I, -L ci C L C UliUeil t Ld L 1 O U W a S 

determined in triplicate and the shown velocities are based on 
10 the mean values. 

Fig. 4: Schematic presentation of p FAB 60 constructs. 
Upper panel: pFAB60/CI»20 contains the truncated version of 
CI-2A is inserted in frame with a pelB leader sequence at the 
5 T end and the deleted gene encoding a fragment of the Ml 3 
15 phage surface protein III (Aglll) at the 3' end. The pelB 

leader directs the expressed fusion protein to the bacterial 
membrane thereby facilitating incorporation of the CI-2A/ApIII 
fusion protein into phage particles. 

Middle panel: pFAB60 /muCI -2A contains the CI-2A cDNA including 
20 the recognition sites for Muni I and Sail restriction enzymes. 
Lower panel: pFAB60/muCI-2A_rc has amino acid 58-61 substi- 
tuted for the underlined 19 randomly composed amino acids (SEQ 
ID NO: 38, residues 2-20) . The amino acids are shown in one 
letter codes and the numbers refers to the unmodified CI-2A 
25 amino acid sequence (cf. The numbering in SEQ ID NO: 2). 

Fig. 5: CI-2A and CI-2A_rc can be displayed on the surface of 
phage particles. 

Anti CI-2A antibodies were immobilized and incubated with 0 or 
5 x 10 11 phage particles, produced and purified as in Johansen 
30 et al., 1995, Protein Eng. 10, pp. 1063-1067 and quantified by 
OD269 measurement. CI-2A negative (-) CI-2A or CI-2A_rc carry- 
ing phage particles were used. The retained phage particles 
were finally detected by a horse radish peroxidase conjugated 
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anti phage antibody. The results shown are mean values from A 
independent measurements with indicated standard deviation. 

Fig. 6: Construction of a CI-2A presented library using the 
Muni and Sail cloning sites. 
5 A) A degenerated oligo (SEQ ID NO: 33, residues 21-46) cover- 
ing the recognition sites for Muni and Sail is converted to a 
double stranded form in a single extension reaction. 
B) Schematic presentation of the modified cDNA of a CI-2A 
peptide library. 
10 C) Three-dimensional structure of the 64 C-terminal amino 

acids. The substituted amino acids are shown in white with the 
sequence of the randomized CI-2A gene above. 

Fig. 7: Construction of a CI-2A presented library using BamHI 
and Sail cloning site. 

15 A) The 5' part of the CI-2A cDNA can be made by using overlap- 
ping oligos that after annealing are extended by the use of a 
DNA polymerase. The resultant fragment can then be cleaved 
with BamHI and Sail before ligation into pCMVbipep/muCI-2A. 
B) Another strategy is to use the randomized oligo in a PCR 

20 together with an upstream forward primer. Again a BamHI/ Sail 
fragment can be purified an ligated into pCMVbipep/muCI-2A. 



PREAMBLE TO EXAMPLES 

In the following, the present invention is illustrated by way 
of example wherein CI-2A is used as starting point for the 

25 CellScreen™ technique adapted according to the invention so as 
to allow intracellular expression of a scaffold protein inhi- 
bitor which is randomly modified in the active site. This 
example is non-limiting, in the sense that other suitable 
protein inhibitors of enzymes could be used instead of CI-2A. 

30 The skilled person can readily perform the necessary substitu- 
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tions and modifications necessary in order to apply the prin- 
ciples exemplified below using other protein inhibitors of 
enzymes as starting point. 



EXAMPLE 1 



1-a: Construction of pCMVbi. pep/Cl-2A 

The 5788 bp hybrid plasmid pCMVbipep (the sequence of which is 
set forth in SEQ ID NO: 39 and which is shown schematically in 
Fig. 1 with CI-2A inserted downstream the packaging signal) 

10 consists of an AKV derived retroviral insert which has been 
cloned into the pUC-19 cloning vector. The retroviral insert 
contains a chimeric 5' LTR, allowing expression from the 
strong cytomegalovirus (CMV) promoter when transcription is 
driven from the plasmid. Following integration of the vector 

15 into a host genome, transcription is driven from the retrovi- 
ral LTR. A versatile poly-linker is present downstream of the 
packaging signal. This enables the insertion of peptide li- 
braries in this position. Immediately downstream of the 
polylinker is an internal ribosomal entry site (IRES), derived 

20 from the encephalomyocarditis (EMC) virus (Koo et al . 1992, 
figure 2A-G) . This allows efficient translation from the 
downstream expression cassette in a CAP independent (IRES) . A 
Neo marker gene is found in the downstream expression cas- 
sette . 

25 pCMVbipep includes a chimeric CMV and Akv promotor/enhancer. 
It was constructed from the vector plasmid constructs PUT 64 9 
(CAYLA, toulouse FRANCE) and AkvBiPep (Duch et al., unpub- 
lished, described below) . AkvBiPep was digested with EcoRI and 
AscI and the 2779 b P fragment from position 3293 through 5670 

30 to 401 was isolated and ligated to a 543 bp fragment contain- 
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ing the enhancer /promotor of CMV position 44 to 548. This 
fragment was obtained by PCR amplifying an EcoRI and AscI 
digest of PUT 649 using the following two primers 
GGTCAGGAATTCTCCGGAATTGGCTAGCCTAGAGTCCGTTACATAACT (SEQ I D NO : 
5 40) and 

GAGGACTGGCGCGCCGAGTGTGGGGTTCTTACCCTTTTTATAGACCTCCCACCGTACACGCC 
T (SEQ ID NO: 41). The resulting fragment was digested with 
AscI and ligated to an AscI fragment of AkvBiPep (fragment 
from position 828 to 3293). 

10 AkvBiPep was constructed by digesting pBiZeo-Neo (Duch et al., 
Biotechniques, 1999, 26(6): 1032-4, 1036) with BamHI and Stul. 
The fragment from 1708 through 6054 to 1240 was ligated to a 
DNA fragment made by extending with two olignucleotides . The 
oligonucleotides were 

15 AGATCTCCGAGGCCTGGGACCCTTCGAATTCGTTAACTGATCAACGCGTTCTAGACTACATG 
GCGGCCGCGTGTTT (SEQ ID NO: 42) and 

GGGGGATCCAGAGCTCGAGCTTTGAAAAACACGCGGCCGCCATG (SEQ ID NO: 4 3). 
After extension and prior to ligation, the DNA fragment was 
digested with BamHI and Stul. This operation removes the Zeo 
20 gene in pBiZeo-neo and replaces them with a polylinker. 

To construct pCMVbipep/CI-2A, 2 yg pCMVbipep was digested with 
BamHI / Xhol as described in A-5. The 5778 base pair fragment 
was subseguently purified from a 0.8% agarose gel according to 
A-7. A CI-2A cDNA fragment encoding amino acids 21-83 (cf. the 

25 amino acid numbering in SEQ ID NO: 2) of wild type CI-2A 

protein fused to an in frame methionine start codon was ampli- 
fied by PCR from a pUC 19 derived plasmid containing a frag- 
ment encoding the wild type CI-2A protein (cf . the nucleotide 
numbering in SEQ ID NO: 1) (Generous gift from Dr. Ib G. 

30 Clausen, Novo Nordic A/S, Denmark) - the cDNA fragment in- 
cludes the nucleotide sequence of SEQ ID NO: 1, where nucleo- 
tides 249, 252, and 279 are T, C, and T, respectively. The PCR 
conditions in the 4 performed reactions were as described in 
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A-3 ur.inq the following primers CGGGATCCATGAAGACAGTGGCCAGAG 
(SEQ ID NO: 3) and CGCTCGAGTCAGCCGACCCTGGGGACCT (SEQ ID MO: 4) 
which t lank the specified ee i < -n of CI-2A with ruce-mt n >r. 
-ites : <: > : Ba/»H I and Xhol . The reaction mixture- was applied to 
: > 1 l > rhiv-H-^t; 0 p (.cycles of M V ; or 1 minute, 60°C • a : :o s-- 
eonds, 72°G for 3D secoml; i. . d t owed by pu r i f i ca t i oi : of the 
amplified DNA f raunient as described in A-3 with a 100 pit 



(see sect ion A~5 ) and purified {see section A- 7 ) using 40 
10 units, of BamRI and Xtiol with l.:0 yl as final react: ion volume. 
This fragment was liaated into the- Bamlil/Xhol cleaved 
pCMVbl pep (see section A- 1 ) .and confirmed by DNA sequencing 
(section A-3) using the pGMVbipep specific primer 
CTGTATCTGGCGGCTCCGTGG (SEQ ID NO: 5) . 

1 5 1 -b : ( .V > n s t r u c t i o n o f omCAT IR ES h y g 

To enable transduction of non-murine cells with retroviral 
vectors harvested from the Bosc packaging cell line a vector 
(prnCATIREShyg ) encoding the ecotropic receptor was construc- 
ted. The mCAT c DNA was obtained from pJET (Albritton et al. 

20 (1939) Gell 57, pp 659-666) and inserted into the pIRESHhyg 
(CloneTech) to give rise to pmCATIREShyg . This was done by 
digesting pJET in 1 x EcoRI restriction enzyme buffer together 
with 2 units/ul EcoRI (see section A-5) . The 5' overhangs of 
the EcoRI digestion were filled out using K.lenow polymerase 

25 according to the manufacturer's protocol (New England Bio- 
labs). After incubation for 1 hour the sample was purified by 
phenol /CHC1< extraction (see section A-4) followed by digestion 
with 20 units BamRI in a reaction volume of 20 pi (see section 
A-6) and purified on a 1.0% agarose gel (see section A-7). 

30 This fragment was ligated into the 5699 base pair BamHI/ BstXI 
fragment of pIREShyg prepared as the mCAT fragment except that 
the blunt ended termini were derived from the BstXI site. 
After ligation and transformation, positive clones events were 
isolated and confirmed by DNA sequencing (section A-9) using 



Tjit.' puriLieu iaJA t ragment was 



thoi d i qos t ed 
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the following primers ACAGCTGGCCCTCGCAGAC (SEQ ID NO: 6), 
CCCACTGCTTACTGGCTTAT (SEQ ID NO: 7), TGGGGCTGCACGTCATTTG (SEQ 
ID NO: 8), TGTGCTACGGCGAGTTTGGT (SEQ ID NO: 9), 
GGTTCGTGAAAGGCTCCATT (SEQ ID NO: 10), and 
5 GAAATGTTCACAATTAGCCCTG (SEQ ID NO: 11). 



1-c: Construction of pCMVbipeoNLS/CI-2A 

An oligonucleotide encoding the SV-40 large T antigen nuclear 
localization signal was added in frame to the N-terminus of 
the truncated CI-2A version situated in pCMVbipep/CI-2A (Fig.- 
10 1). The construction was performed by PCR (see section A-2) 
using 25 pmol of 

GAAGATCTATGGCGGCCGCACCAAAAAAGAAGAGAAAGGTAGGATCCATGAAGACAGAGT 
(SEQ ID NO: 12) and CGCTCGAGTCAGCCGACCCTGGGGACCT (SEQ ID NO: 
13) as primers and pCMVbipep/CI -2A as template. This reaction 

15 was performed in duplicate and applied to 25 cycles consisting 
of 94°C for 1 min, 45°C for 1 minute, 72°C 1 minute. The 
solution was ethanol precipitated before purification of the 
250 base pair DNA fragment using a 2% agarose gel (see section 
A-7). The termini of the fragment were trimmed by addition of 

20 20 units of BgJII and Xhol and 6 pi NEBuffer 3 to a 60 pi 
reaction volume (see section A-6) including 1 mg/ml bovine 
serum albumine (New England Biolabs) . The cleavage products 
were separated on a 2% agarose gel and the 240 base pair 
fragment was subsequently purified (see section A-7) with a 50 

25 pi elution volume. This fragment was finally ligated into 
BamHI/XhoI cleaved pCMVbipep (see construction of 
pCMVbipep/CI-2A) and confirmed by DNA sequencing (see A-8). 

1-d: Construction of pCMVbioepSL/CI-2A and pCMVbipepER/CI -2A 
The secretoric leader (SL) was amplified by PCR using 
30 pBapePuro containing the human immunoglobulin heavy chain 
signal peptide (Beerli et al. (1994) J. Biol. Chem. 269, 
pp23931-23936) as template and 25 pmol of 
GAAGATCTATGGACTGGATCTGGCGCATCC (SEQ I D NO : 14) and 
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GAGGATCCAGAATGAGCGCCGGTAGCAG ( SEQ ID NO: 15) as primers. The 
reaction conditions were similar to those described in con- 
st cue t i on of pCMVb i pepNLS /CI -2A . The amplified 74 base pair 
product was after phenol /CHOI < extraction (see section A-4) 
5 digested with BamU ! and Bgl [ I 13 described in section A~5 

except thai; the reaction included 1 ul/rnl bovine serum a 1 bum in 
(Mew England Biolabs). After digestion, the 63 base pair 
1 l .a-jin^iii. wuo puL.iLj.eo using a 6 low melting agarose gel (see 
section A-6) ending up with a ?0 u.L volume. A Ba^Hl digested 

10 pCMVbipe P /ci-2A vector was prepared essentially as described 
for construction of pCMVb ipep/C I -2A except that 10 units of 
calf intestinal phosphatase (Booh ringer Mannheim) were present 
during the Last hour of the incubation period. The purified 
secretory leader containing fragment was then ligated into the 

15 BamHI cleaved pCMVbipep/CI -2A (as described in section A-l) to 
give rise to pCMVbipepSL/CI-2A. Positive clones were confirmed 
by DNA seguencing (section A-8) using CTGTATCTGGCGGCTCCGTGG 
(SEQ ID NO: 16) as primer. 

Construction of pCMVbipepER/CI-2A (Fig. 1) was performed by 
20 addition of a retention signal to the C-terminus of CI-2A in 
the context of pCMVbipepSL/CI-2A. The retention signal was 
added in frame with the CI-2A sequence by PCR using 
CTAATCTAGACTACAGCTCGTCCTTGTAGTCCTCGAGGCCGACCCTGGGGACCTG ( SEQ 
ID NO: 17) and CGGGATCCATGAAGACAGAGTGGCCAGAG (SEQ ID NO: 18) 
25 and pCMVbipep/CI-2A as template. The 237 base pair fragment 
was digested with Xbal instead of Xhol and ligated into a 
BamHI/Xbal digested pCMVbipepSL/CI-2A prepared as BamHI/XhoI 
digested pCMVbipep. All reaction parameters were as described 
for construction of pCMVbipepNLS/CI-2A. 

30 1-e: Construction of pCMVbipep/muCI-2A 

Two silent mutations were introduced into the CI-2A sequence 
by a 5 step PCR procedure. The first PCR exchanged C 192 -+ A 
(cf. The numbering in SEQ ID NO: 1) and the product was after 
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purification used as reverse primer for amplification of the 
CI-2A encoding region in pCMVbipep/C I -2A . This C 19:> — ♦ A mutated 
fragment was used as template for introducing the second 
mutation substitution of T 300 -+ C (cf . The numbering in SEQ ID 
5 NO: 1) and thereby leading to a PCR-product including the 
previously introduced mutation. The double mutated fragment 
was used as forward primer to amplify the pCMVbipep/CI-2A 
defined reading frame. All the PCRs were done in duplicate and 
in essence performed as described in section A-2 with 25 

10 cycles of 94°C for 30 seconds, 55°C for 30 seconds, and 72°C - 
for 30 seconds. The products were pooled before purification. 
In the first PCR, 50 ng of pCMVbipep/CI-2A was used as tem- 
plate with 50 pmol of the following primers 
CCGGCCTTATTCCAAGCGGC (SEQ ID NO: 19) and 

15 CTGCCGGTGGGTACAATTGTGACCATGG (SEQ ID NO: 20) . The 248 base 

pair product was purified using the QIAquick PCR purification 
kit (see section A-3) with 50 pi as elutiori volume (PCR-pro- 
duct 1). The second PCR consisted of 12.5 pi PCR-product 1 and 
50 pmol CTGTATCTGGCGGCTCCGTGG (SEQ ID NO: 21) as primers with 

20 50 ng pCMVbipep/CI -2A as template. The complete reaction was 
loaded on a 2% agarose gel and 448 base pair product was 
purified as described in section A- 6 (PCR-product 2). This 
product was then used as template in the third PCR that be- 
sides the 12.5 pi PCR-product 2 consisted of 50 pmol 

25 CTGTATCTGGCGGCTCCGTGG (SEQ ID NO: 22) and 

CGAGTTTGTCGACAAAGAGGCGGACGCGATCGATGCGATATTCC (SEQ ID NO: 23) 
leading to amplification of a 248 base pair fragment (PCR- 
product-3) . This fragment was purified as PCR-product 2 before 
using 12.5 pi as forward primer together with 50 pmol of 

30 CCGGCCTTATTCCAAGCGGC (SEQ ID NO: 24) and 50 ng pCMVbipep/CI-2A 
as template. The double mutated 448 DNA fragment was after gel 
purification PCR amplified using 50 pmol of 

CCGGCCTTATTCCAAGCGGC (SEQ ID NO: 25) and CTGTATCTGGCGGCTCCGTGG 
(SEQ ID NO: 26) . The amplified double mutated fragment was 
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inserted into pCMVbipep described for construction of 
nCMVbipep/CI-2A. 

J^Lf^_,Const ruction of DFab60/CI-2A and pFAB60 / muCI-2A 
The f ragment of the CI-2A cDNA that encodes amino acids 2.1-83 
5 <.f the wild type protein was inserted into the phageniid vector 
pFABGO (Johan:.;en ct al . , 1995, Protein Eng. 10, pp 1063-7). 

^- L ^uiv*\ LiciLjiiitjiL was amprniea oy t'CK using 

pCMVbipep/CI-2A as template with 10 pmo.L of the primers 
ATTTGCTAGCTGCACAACCAGC7VATGGCACTGAAGAt!AGAGTGGCCAGAGTTGG ( SEQ I D 

10 NO: 27) and ATAAGAATGCGGCCGCGCCGACCCTGGGGACCTGGGC (SEQ ID NO: 
28) in 4 reactions performed under conditions similar to those 
described under construction of pCMVbipep/CI-2A. The purified 
237 base pair fragment and 2 rag pFAB60 were digested in paral- 
lel using 10 units Nhel and Not I in 50 ml 1 x NEBuffer 4 

15 including 1 }ig/ml bovine serum albumin (see sections A-5 and 
A-6) . The digested 225 and 4655 base pair fragments were 
subsequently purified using 1% agarose gel and ligated to each 
other (see section A- 7) and A-l, respectively. Correct 
insertion was finally confirmed by DNA sequencing using 

20 CACACAGGAAACTATGA (SEQ ID NO: 29) as primer (See section A-8). 
An identical approach was utilized for constructing 
pFAB60/muCI-2A except that pCMVbipep/muCI-2A was used as CI-2A 
cDNA source . 



1-g: Con struct ion of pFab 60 /muC I — 2 A rc 

25 Substitution of amino acid 59 to 62 in the full length CI-2A 
sequence with a 19-mer randomly composed amino acid sequence 
was performed in the context of pCMVbipep/muCI-2A and the 
modified CI-2A was subsequently moved to pFAB60 following the 
same procedure as for constructing pFAB60/CI-2A. The coding 

30 region for the amino acid sequence was obtained from a 
synthetic oligo that was amplified by PCR. Four parallel 
reactions were performed using 12.5 pmol of 

CTGCCGGTGGGTACAATTGTGCTGCGCTACATGGACCGCGCAATAGTGATGAACGTGAACAT 
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TAGCGCACGCAAACTACGGATTGATCGCGTCCGCCTCTTTGTCGACAAACTCG ( SEQ I D 
NO : 30 ) as template with 50 pmol of the primers 
CGAGTTTGTCGACAAAGAGGCGGAC ( SEQ ID NO: 31) and 
TCTGCCGGTGGGTACAATTG (SEQ ID NO: 32) . These reactions were 
applied to 25 cycles of 94°C for 2.5 minutes, 45°C for 2 
minutes, and 72°C for 2 minutes with all other conditions as 
described in section A-2. The product was purified by phe- 
nol/CHClj extraction and dissolved in 30 pi sterile H 2 0. To 
facilitate insertion into muCI-2A, 5 pi of the fragment was 
supplied with 20 units Muni (Boehringer Mannheim) , 5 pi 10 x - 
SureCut buffer M (Boehringer Mannheim) and sterile H 2 0 to 
adjust the volume to 50 pi, and incubated at 37°C for 4 hours. 
After addition of 6 pi 10 x Sail restriction enzyme buffer and 
3 pi H 2 0 together with 20 units of Sail the incubation con- 
tinued for 4 hours. Purification of the 89 base pair fragment 
was done using a 2.5% low temperature agarose gel (see section 
A-6) . Preparation of the Muni/ Sail digested pCMVbipep/muCI-2A 
was in principle similar to the procedure described above. 
Four pg pCMVbipep were digested with 30 units of Muni in a 150 
pi reaction followed by addition of 30 units Sail in 50 pi 4 x 
Sail restriction enzyme buffer and purified using a 1% gel as 
described in section A-7. The ligation and confirmation 
procedures were performed as for the construction of 
pCMVbipep/CI-2A. 

1-h: Library construction 
The degenerated oligo 

TCTGCCGGTGGGTAGAATTCNNNNKNNKNNKNNKNNKNNKNNKNNKCGGATTGATCGCGTCC 
GCCTCTTTGTCGACAAACTCG (SEQ ID NO: 33) was converted into a 
double stranded form by an extension reaction. The oligo was 
mixed with a 3-fold excess of CGAGTTTGTCGACAAAGAGGCGGAC (SEQ 
ID NO: 34) in 1 x SuperTaq buffer (Enzyme Technologies Ltd.) 
including 8 units SuperTaq polymerase (Enzyme Technologies 
Ltd.) and 0.2 mM dNTP. Heating of the mixture to 94°C for 1 
minute followed by 45°C for 2 minutes was done to ensure a 



WO 00/05406 




PC;T/i)K99/00408 



sufficient annea.L.iruj between the oligos before the temperature 
was changed to 55°C for 4 5 minutes to increase polymerase 
activity. After the reaction was completed, the sample was 
phenol /CHC1 3 extracted before; cleavage of 1/3 of the product 
5 with 100 units KceRl and Sail in 1.00 pi 1 x EroRI buffer 

including 1 pg/ml bovine se-rum albumin (See section A-6). The 
complete digested to base pair DNA fragment was purified on a 
' }r>:: ] t e:r:s or ; V ; : , ^ • culliuji o ; ana Mgated 

(see section A- 1 ) into the Muni /SY? 2 I cleaved pCMVbipop /inuC I" -2A 
10 described in Example 1-g. Randomly chosen colonies were: se- 
quenced as describe - i for const ruction of pCMVb ipep/Ci -2 A . 

1-i: Subti l i s in a s.-c : i y 

The ceil extracts were diluted to the indicated amount:;: in 10 
Vi 1 PBS and added to 25 pi 0.1 M Tris/HCl pH 8.6 containing 5 * 

15 10" 8 M Subtilisin Carlsberg (Sigma) . After incubation at 25°C 
for 30 minutes, 2 5 pL 0.1 M Tris/HCl pH 8.6 containing 5 mM N- 
succinyl-Ala-Ala-Pro-Phe-p-Nit roanil ide (Sigma ) was added and 
the substrate conversion was followed by measuring the absor- 
bency at 405 nm every two minutes until the reaction had 

20 reached exhaustion. 

1-i: Cell lines and culturing conditions 

All the cells were cultured in Dulbeco's modified Eagle's 
medium (DMEM) containing 10% fetal calf serum or 10% new born 
calf serum, 1% L-glutamine, and 50 yg/ml penicillin/strep- 
25 tomycin and incubated under standard cell culture conditions. 

The 293mCAT cells and U20SmCAT cells are derived from 293 
(ATCC# CRL-1573) and U20S (ATCCff HTB-96) by stable transfec- 
tion with pmCAT/IRES-hyg . The 293 cells were transfected by 
the Calcium co-precipitation method outlined for transfection 
30 of the Bosc packaging cells and the U20S cells were tranduced 
using the Fugene transfection kit according to the manufac- 
turer's protocol (Boehringer Mannheim). Stable transf ectants 
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were selected by culturing in the presence of 150-200 pg 
active hygromycin B/ml (Sigma) . After transducing 293mCAT or 
U20SmCAT with retroviral vectors the selection was shifted to 
0.6 mg/ml or 0.4 mg/ml geniticin (GibcoBRL ) , respectively. 

5 The murine NIH-3T3 cell line (ATCC# CRL-165) was upon 

transduction with the retroviral vectors cultured in the 
presence of 0.6 mg/ml geniticin. 

1-k: Transduction of N1H-3T3, 293mCAT, and U20SmCAT cells 
For production of retroviral vector particles, pCMVbipep 

10 derived constructs were transfected into the BOSC packaging 

cell line using a CaP0 4 co-precipitation method. BOSC packaging 
cells (also known as BOSC23 cells, cf . WO 94/19478) were 
diluted to 5 x 10 5 cells/cm 2 the day before the transfection 
and washed once in complete DMEM 2 hours before the transfec- 

15 tion. The CaP0 4 co-precipitated mixtures were prepared by 

diluting 10 mg of pCMVbipep or the pCMVbipep derived construct 
with 10 mg salmon sperm DNA in 450 ml ddH 2 0 and adding 50 ml 
2.5 M CaCl 2 . These solutions were slowly added to 500 ml 2 * 
HEPES-buf f ered saline pH 7.05 (280 mM NaCl, 1.5 mM Na 2 HP0 4 , 50 

20 mM HEPES/NaOH pH 7.05) under gentle shaking followed by two to 
five minutes of incubation at 25°C before adding the precipi- 
tate to the prepared BOSC cells. After 24 hours of incubation, 
the cells were washed twice in PBS (137 mM NaCl, 2.7 mM KC1, 
8.3 mM Na 2 HP0 4 1 . 4 mM KH 2 P0 4 ) and further cultivated in 10 ml 

25 DMEM including supplements for another 24 hours. The media 
from the transfected BOSC cells were collected and diluted 
from 10 to 10 5 -fold in complete DMEM including 6 mg/ml Polybre- 
ne (Sigma) . The recipient cells, which had been plated at 10 4 
cells/cm 2 and incubated at standard cell culture conditions for 

30 24 hours in complete DMEM, were exposed to virus-containing 
media for 24 hours at standard conditions. After this incu- 
bation period, the transduced cell were washed twice in PBS 
and incubated in complete DMEM including geneticin. 
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1. - 1 : Pre paration of total cell extr acts 

The CMVbipep/CI -2A or CMVbipep transduced NII1-3T3 cells were 
harvested from two confluent T17 5 cell flask:-, by incubation 
with 5 ml. 0.5 « Tr yps in-KDTA solution ( GibcoBRL) /p La te . The 
5 recovered cells were diluted 1:1 in complete DMEM arid subse- 
quent:ly collector] by cen t r i f uga t ion , washed twice in 10 ml 
complete DM KM and twice in 1 m L PBS. Finally, the cells were 
> > L* : -■ |.n:'!iu»'u in iuu u L t- n ij . inc ceil s were r e n ci e r e d p e r rn e a b 1 e b y 
three cycles of" freezing in liquid nitrogen and thawing by 
10 incubation at M °C and subsequently centrif uqed at 20000 * g 
her 1 5 minutes to remove cell debris. To inactivate endogenous 
protease activity, the extracts were incubated at 6j°C for 15 
n i i r i u r. e s a n d r e -centri fug e d . 

1 - n\: P r eepa r ation of nu c 1 e a r e x t r a c t s 

15 Two approximately 80% confluent T-175 cell flasks of each cell 
line were used. The cells were harvested by addition of 3 ml 
Trypsin/EDTA solution (GibcoBRL) to each flask followed by 
cent ri f uga t ion of the 6 ml cell suspensions. All the following 
reactions were performed at 4°C using chilled solutions. The 

20 harvested cells were washed in 10 ml complete DMEM, 10 ml PBS, 
and 2 times in 1 ml PBS. After the last wash the cells were 
suspended in 1 ml NP40 lysis buffer (10 mM Tris/HCl pH 7 . 4 , 10 
mM NaCl, 3 mM MqCl 2f and 0.5% NP-40), mildly mixed and incu- 
bated on ice for 5 minutes. The cells were then collected by 

25 centrif ugation at 500 x g at 4°C for 5 minutes and suspended 
in 1 ml NP-40 buffer and the nuclei were immediately harvested 
by repeating the cent ri f ugation . Proteins were extracted from 
the nuclei by suspension in 50 pi low salt buffer (20 mM HEPES 
pH 7.9, 25% glycerol, 0.02 M KC1, 1.5 mM MgCl ? , 0.2 mM EDTA) 

30 followed by addition of 50 pi High salt buffer (Low salt 

buffer supplied with 1 M KC1). After 30 minutes incubation on 
ice, the nuclear extract was cleared by centrif ugation at 
20000 x g for 30 minutes. The supernatants were stored at 
-20°C. 
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1-n: SDS-PAGE and Western blots 

Samples were supplied with H volume of SDS-load buffer (NOVEX) 
including 1/10 volume 1 M dithiot rei tol and heated to 95°C for 
2 minutes before loading on a NuPage (NOVEX) 4-12% gradient 
5 SDS-gel in 1 x MES buffer (NOVEX) and run at a constant cur- 
rent of 40 mA. The gels were equilibrated in blotting buffer 
(lOmM CHAPS/NaOH pH 11.0, 10% Methanol, 0.5% SDS) for 5 mi- 
nutes before transfer of the proteins to a 0.2 pm Obititran 
BA-S 83 membrane (Schleicher & Schoell) by semi dry blotting 

10 for 70 minutes. The membrane was allowed to air dry before - 
blocking 1 hour at room temperature in ECL-buffer (50 mM 
Tris/HCl pH 7.4, 150 mM NaCl, 5 mM EDTA, 0.5% Gelatin, 0.1% 
NP-40) . After blocking, the ECL-buffer was displaced with anti 
CI-2A rabbit serum diluted 1:1000 in EC L— buffer and the incu- 

15 bation was continued for 1 hour. The membrane was subsequently 
washed 3 times, each by incubation for 20 minutes in ECL- 
buffer before adding 1:5000 fold diluted Horse Radish 
Peroxidase conjugated goat anti rabbit serum (Dako) in ECL- 
buffer and incubated and washed as described above. The 

20 development of the signal was done using an enhanced 

chemoluminescent kit according to the manufacturer' s protocol 
(Amersham- Pharmacia) . 



l-o: Production of anti C1-2A polyclonal antibodies, 
immunoprecipi tat ion . 

25 Rabbits were immunized with 100 pg full length recombinant CI- 
2A (generous gift from Dr. Ib G. Clausen, Novo Nordic A/S, 
Denmark) in complete Freunds adjuvant (Sigma) and boosted 4 
times with 3 weeks intervals by injecting 50 yg CI-2A formu- 
lated in incomplete Freunds adjuvant. The anti CI-2A response 

30 was detected by ELISA with recombinant CI-2A immobilized on a 
MaxiSorb plate (Nunc) . Several dilutions of the CI-2A serum 
were allowed to bind and subsequently detected with a horse 
radish peroxidase conjugated goat anti rabbit antibody (Dako) . 
All reactions were performed as described in example 1-p. 
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Protein A containing sepharose CL-4B beads (Pharmacia Biotech) 
were pro-washed in dilution buffer (PBS including 0.1?, Triton 
X-100, 0.1% bovine serum albumin) before addition of 4 0 pi 
bead volume to 100 pi of the harvested media together with 10 
r » of the an: i CT-2A antibody. The mixtures were rotating at 

A°C for 1 hour followed by 3 washes in dilution buffer. The 
beads were suspended in 2 x SD3 loading buffer and run on a 

----- - ,~ ^ ... ^n \. jjy wti-.Lt:i.ii bioL (Example 1- 

n) . 

1 0 1-p: ELI SA 

Purified pig anti rabbit polyclonal antibody (Dako) was di- 
luted to 120 ng/ml in 0.1 M NaHCO , pH 8.5 before 50 pi aliquots 
were added to each well in a MaxiSorb plate (Nunc) and incu- 
bated at rc over night. The plate was washed 3 times in ELISA 

15 wash buffer (0.01% pyrophosf ate, 500 mM NaCl, 0.1'; Triton X- 
100) and 50 pi rabbit anti CI-2A serum diluted 1:200 in bin- 
ding buffer (PBS supplied with 0.5% bovine serum albumin, 0.5% 
Tween-20, 10% glycerol) was added to each well. After 1 hour 
of incubation at room temperature, the washing procedure was 

20 repeated and the indicated number of phage particles was added 
in binding buffer. The phage particles were allowed to react 
for 1 hour at room temperature before washing the plate 10 
times in ELISA wash buffer. Bound phage particles were de- 
tected by using a horse radish peroxidase conjugated anti 

25 phage antibody (Pharmacia biotech) by binding and washing 

procedures as described above. The horse radish peroxidase was 
finally assayed using OPD-tablets according to the Manufac- 
turer's protocol (Sigma). 

Standard reactions : 



A 1: Ligation and transformation of Escherichia coli by 
electroporation 
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20 
25 

The 



30 ng vector 

2 pi fragment (containing a fragment concentration re- 
sulting in a 10:1 relation between fragment : vector ) 
2 pi 10 x T4 DNA ligase reaction buffer (New England 
Biolabs ) 

300 units T4 DNA ligase (New England Biolabs) 
Sterile H,0 to adjust the volume to 20 pi 

198 cycles consisting of 30°C for 30 seconds and 10°C for 
30 minutes followed by 16°C over night 

ligation products were transformed into E. Coli by 
t roporat ion : 

10 pi of the ligation reaction was dialyzed against 2 ml 
sterile H : >0 

2 pi of the dialyzed ligation reaction were mixed with 25 
pi competent E. Coli in a 0.1 cm electroporation cuvette 
(Bio-Rad) and pulsed with 2 kV 

Addition of 1 ml of SOC-media (20 g Bac to-Trypt one , 5 g 
Yeast extract, 0.5 g NaCl, 0.19 KC1, dissolved in 950 ml 
H 2 0 and adjusted to pH 7.0 using NaOH and supplied with 50 
ml 20% glucose and MgCl 2 to a final concentration of 10 mM 
before use) 

Incubation at 37°C for 0.5-1 hours before streaking 50 pi 
- 200 pi on LB plates (100 ml LB (10 g Bacto-Tryptone, 5 
g Yeast extract, 5 g NaCl, H 2 0 to 1 1) including 1.5 g 
agar and 50 mg/ml carbenicillin ) 

electroporation competent E . Coli cells were prepared as 



An over night culture was used to inoculate 37°C LB to 
achieve an OD 600 below 0.020 

The culture was incubated at 37 °C until OD 600 was between 
0.8-1.0 and subsequently chilled on ice 



described below: 
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The chilled culture was harvested by centrif ugation 
The harvested bacteria were suspended in 300 ml 10^ ice 
cole q 1 ycerol soiu t i on 
R e p e t i t i o n of t h o c e n t r i f u g a t: i o n 

3 - Th- suspension and cent ri f uqation steps were repeated 
t w i c o 

The bacteria were suspended in 50 ml 10% ice cold gly- 
cerol . . .1 ^ L ; a, 'id kjuuIu^iuu uy ecu l r r i. uga z r on 
The bacteria were suspended in 10o ice cold glycerol 
10 buffer to a final density at 6-12 x 10 1:> cells/ml and 
s torrid at. -8 0 °C in all quo ts 

A-2 : Polymerase chain reactions using TaqGold™ 

100 ng template 

- 10 pmol of specified primers, unless otherwise indicated 

15 - 5 ul 10 x TaqGold™ reaction buffer (Perkin Elmer) 

3 ul MgCl 2 solution for TaqGold™ (Perkin Elmer) 
0.4 ul 25 mM dNTP ( dGTP , dCTP, dATP, and dTTP) 
0.4 ul 5 units/ul TaqGold™ (Perkin Elmer) 
Sterile H 2 0 to 50 ul 
20 - All reactions were initiated by 10 minutes at 95°C and 
after the specified cycles followed by 72°C for 4 minu- 
tes . 

For purification of PCR products see section A- 3 
A-3: PCR using Vent™ DNA polymerase 

25 - 100 ng template 

10 pmol of specified primers, unless other is indicated 
5 pi 10 x Vent™ DNA Polymerase reaction buffer (New Eng- 
land Biolabs ) 
0.4 ul 25 mM dNTP 

30 - 0.4 ul 5 units/ul Vent™ DNA Polymerase (New England Bio- 

labs) 
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Sterile H,0 to 50 yil 

All reactions were initiated by 1 minute at 94°C and 
after the specified cycle program followed by 72°C for 4 



minutes 



5 The amplified DNA fragments were purified by phenol/CHCl 3 

extraction followed by ethanol precipitation if the size was 
less than 100 base pairs. Otherwise, the QIAquick™ PCR-purifi- 
cation kit was used (Qiagen) . The PCR-puri f ication kit was 
used according to the manufacturer' s protocol with 50 \il as > 
10 the elution volume. 

A-4: Phenol/CHCl-, extraction 



15 - 



20 - 



The sample volume adjusted to at least 200 pi with ste- 
rile H 2 0 

Addition of 1 volume Phenol pH 6.7 (supplier) 
Mixed extensively and centrifuged. 

Addition of 1 volume phenol/CHCl 3 1:1 solution to the 
water phase 

Repeat the mix-spin procedure 

Addition of 1 volume CHC1 3 to the water phase 
Repeat mix-spin procedure 

Water phase supplied with 1/10 volume 3M NaAc pH 6.0 and 



2.5 volume -20°C 96% ethanol 



Centrifuged at 20.000 x g for 20 minutes, displacing the 
supernatant with 70% ethanol and repeated the centrifuga- 



25 



tion for 2 minutes 



After the DNA was air dried it was dissolved in sterile 



H 2 0 



A- 5 : Vector preparations 



The plasmid DNA used for vector preparations was purified by 
30 Qiagen maxi prep columns according to the manufacturer' s 
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protocol (Qiagen) . All the used restriction enzymes and their 
reaction buffers were purchased from New England Biolabs 
1 1 n .1 e fs s o t \ i o rwise indicate-'!. 

S } i g - 10 uq vect o r DNA 
5 " Approximately 5 to 10 unit:; of the indicated restriction 
enzymes pr. pq of plasmid ON A 

i/'io ui the rmur volume iu x Bamlll restriction enzyme 
buffer unless other is indicated 
Sterile H 2 0 to the indicated final volume 
10 ~ incubation at 37°C until complete digestion was achieved 

65°C tor 20 minutes 

Purification of the digested fragment as described in A- 7 
A- 6 : Fragment preparation 

As for vector preparation (see A-6) except for the amount of 
15 DNA and that the purification of cleaved DNA fragments smaller 
that 100 base pairs were done using low melting agarose. The 
agarose slice was supplied with 300 ul 0.3 M NaAc, pH 6.0 and 
incubated at 65°C until the agarose was completely melted 
followed by phenol/CHCl 3 extraction and ethanol precipitation 
20 (see A-4) to purify the DNA. 

A-7: gel extraction using Qiagen gel purification kit 

Samples were added 1/6 volume DNA load buffer (60% gly- 
cerol, 0.025% Bromphenol blue, 0.025% Xylene Cyanol) 
Loaded on an agarose gel in 1 x TBE (89 mM Tris-borate, 
25 89 mM Boratic acid, 2 mM EDTA) 

After a satisfactory separation was achieved the DNA was 
extracted using the Qiagen gel extraction kit according 
to the manufacturer's protocol with an elution volume of 
50 ul onless otherwise indicated 
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A-8: DNA sequencing 

All DNA sequencing was performed with the DNA sequencing Kit 
BigDye™ terminator cycle sequencing (Perkin Elmer) using a 25 
cycle program consisting of 96°C for 10 seconds, 50°C for 5 
5 seconds, and 60°C for 4 minutes. The amount of DNA was between 
0.2 and 0.5 pg with 3.2 pmol of the indicated primers. The 
BigDye™ was diluted twice according to the manufacturer' s 
advice. After the cycle program, the DNA was ethanol precipi- 
tated and extensively washed with 70% ethanol before analyses 
10 using an ABI prism 310 sequencing machine (Perkin Elmer). 
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EXAMPLE 2 

Expression of CI-2A in mammalian cells 

A potential scaffold for intracellular presentation of peptide 
libraries expressed from the retroviral vectors utilized in 

15 the CellScreen™ technology should not have any significant 
effects on neither the retroviral replication cycle nor the 
viability of the transduced mammalian cells. Furthermore, the 
scaffold must retain a stable core structure to ensure a 
constrained peptide presentation. In order to test whether 

20 CI-2A fulfills these requirements we constructed pCMVbipep/CI - 
2A (See Example 1-a and Fig. 1). The CI-2A expression con- 
struct was in parallel with pCMVbipep transiently transfected 
into BOSC packaging cells to produce viral particles for 
transduction of NIH-3T3, 293mCAT, and U20SmCAT cells (Example 

25 1-k) . The titers obtained by transduction with the two viral 
vectors were similar, indicating that the CI-2A expression 
unit does not interfere with viral packaging and infection. 

Total cell extracts were prepared from the transduced NIH-3T3, 
293mCAT, and U20SmCAT cells and analysed for the presence of 
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01 --2 A by western blot to obtain a direct proof of the CI -2 A 
expression (Example-:, 1-1 and 1-n). A rabbit anfi C1-2A scrum 
raised acjainst wild typo C1-2A recognized a protein witii 
migrat ion properties that corresponded to that expected for 
5 the 0I-2A t n: ( s e i n expressed f rem OMVbipop/CI -7A . Even an 

ovocexpx su re of the western blot did not revea 1 any bands with 
similar mobility properties in the -extracts derived from 
p"'! 1 .':: s;ee ; r< , , ...>d 1 1-. . •._ d «. . < : i. i. . . tiieieoy snowing cnat s-.dub 1- CI-2A 
is expressed b-y CMVbi pep/C [ -? A and tolerated by the tested 
10 cell lines. 

Interest inuly, a OMVbipep/C I -2A specific band with a slightly 
lower mobility was reproducibly detected by western blots. The 
nature oL this band is uncertain at present. The decreased 
mobility is consistent with a slight secondary modification 

15 such as a phosphorylation, although no experiment ial proof 

exists. Other- more trivial and less likely explanations could 
be a non-consensus start codon use or that the CI-2A stop 
codon is leaky thereby giving rise to an extended CI-2A pro- 
tein. The inventors are currently in the process of characte- 

20 rizing the expression products more detailed by N-terminal 
amino acid sequencing and Mass spectrometry. 

A consistent constrained presentation of a peptide library in 
a scaffold context demands that the structure of the scaffold 
is stable in the given environment. Since CI-2A is naturally 

25 found in the seeds of barley, the shift to the environment 
inside mammalian cells could influence the folding kinetics 
and thereby the stability of CI-2A. The protease inhibitory 
activity of CI-2A provides a simple method for assaying func- 
tionality (Example 1-i), which reflects the amount of native 

30 folded CI-2A. Increasing amounts of the total cell extracts 
used for the western blots were pre- incubated with subtilisin 
before measuring the residual protease active by addition of a 
chromoqenic substrate (Fig. 2). The presence of 0.1-2 yil 
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extract from CMVbipep/ C L - 2A transduced cells completely block- 
ed subtilisin activity, in contrast, when increasing the 
amount of the extracts from CMVbipep transduced cells to as 
much as 10 pi no effects on the subtilisin activity was ob- 
5 served. This result proves that the decrease in subtilisin 
activity observed using extracts from CMVbipep/CI -2A trans- 
duced cells is due to expression of C1-2A. Furthermore, the 
observation that CI-2A extracted from mammalian cells is 
functional suggests a native structure inside the cells, which 

10 support that at random peptide library will be presented by- 
C1-2A in a constrained manner. By comparing the amount of CI- 
2A containing extract required to inhibit subtilisin to a 
standard curve based on purified CI-2A, a rough estimation of 
the active CI-2A concentration can be obtained. This subse- 

15 quently allows a calculation of the cellular concentration. By 
doing so, we estimated the CI-2A content in the tested cell 
lines to be in the uM range. 

The combined evidence from the experiments described above 
suggests that CI-2A is tolerated as an intracellularly located 
20 protein in mammalian cells at a sufficient concentration to 

exert a biological activity. Furthermore, the pronounced CI-2A 
activity found in cell extracts indicates a native conforma- 
tion enabling a constrained peptide presentation from the loop 
region . 



25 EXAMPLE 3 

Expression of CI-2A fusion proteins with a defined subcellular 
localiza tion 

A number of biological reactions are restricted to defined 
cellular compartments. To increase the probability of select- 
30 ing peptides that interfere with such types of reactions we 
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fused amino acid sequences to CI-2A that in other contexts 
have been shown to direct the fusion protein to a defined 
subcellular localization. Signals mediating localization to 
either the endoplasmic reticulum or the nucleus were added to 
test whether or not a restricted localization of CI-2A could 
be achieved . 



_ — .i_ wi4 ^j-ynu.L .Lj_wju Liit: ov-'iu idiqe r-anriqen 
was fused to the N-terminal of CI-2A thereby givinq rise to 
the pCMVbipepNLS/CI-2A construct (Fig. 1 and Example l~c). 

10 NIH-3T3 cells were transduced with CMVbipepNLS/CI-2A, 

CMVbipep/CI-2A, and CMVbipep derived retroviral particles. 
Western blotting and the subtilisin activity assay (see exam- 
ple 1-i and 1-n) were subsequently used to anaLyze the CI--2A 
content in total extracts and nuclear extracts prepared from 

15 all the transduced cell lines. Neither the total extracts nor 
the nuclear extracts derived from the CMVbipep transduced 
cells were able to interfere with the protease activity of 
subtilisin, consistent with the results described in example 
2. In contrast, both the nuclear extracts and the total ex- 

20 tracts derived from CMVbipepNLS/CI-2A blocked inhibiting the 
protease activity of subtilisin whereas only the total extract 
of CMVbipep/CI-2A exerted CI-2A activity (Fig. 3). Western 
blotting revealed an equal amount of CI-2A in the total ex- 
tracts from CMVbipep/CI-2A and CMVbipepNLS/CI-2A transduced 

25 cells, thereby indicating a similar expression level. Consis- 
tent with the activity test, the amount of CI-2A detected in 
the nuclear extracts by western blotting was at least 10-fold 
higher for NLS/CI-2A than for CI-2A. Thus, two independent 
types of analysis support that the presence of the NLS results 

30 in an increased concentration of NLS/CI-2A in the nucleus. 

Endoplasmic reticulum localization requires targeting to this 
compartment and subsequently an ongoing retention. To achieve 
this, the pCMVbipepER/CI-2A construct contains a secretoric 
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leader (SL) peptide and a retention peptide fused to the N- 
terminal and C-terminal of CI-2A, respectively (Fig. 1 and 
Example 1-d) . To be able to evaluate the functionality of the 
retention signal, the pCMVbipepSL/CI-2A construct that only 
5 contained the secretoric leader peptide was made (Fig. 1 and 
see example 1-d) . NIH-3T3 cells were transduced with these 
retroviral vectors in conjunction with the CMVbipep and 
CMVbipep/CI-2A. These constructs allow investigation of the 
activity of both the leader peptide and the retention signal 

10 by determination of the amounts of CI-2A protein secreted to- 
the cell media. After media exchange, the secreted CI-2A 
protein was detected at different time points by a combined 
immunoprecipitat ion western blot procedure (cf. l-o) . SL/CI-2A 
was detected after 3 hours incubation and significantly in- 

15 creased during the incubation periode. In contrast, ER/CI-2A 
was not detectable before 5.5 hours of incubation and thereaf- 
ter only at a level comparable to that produced by the 
CMVbipep/CI-2A transduced cells. The CI-2A found in the media 
from the CMVbipep/CI-2A and CMVbipepER/CI -2A two cell lines is 

20 therefore likely to be due to cell death instead of active 
secretion. In summary, the presence of the leader peptide 
increased the secretion of CI-2A, but this secretion was 
significantly delayed by addition of the KDEL retention sig- 
nal. The combined data propose that the CI-2A expressed from 

25 CMVb.ipepER/CI-2A becomes translocated to the endoplasmic 
reticulum. 



By defining the subcellular localization of a peptide library, 
the likelihood of isolating active peptides interfering with 
reactions known to be restricted to occur at such locations 
30 can be significantly increased. In addition to the examples 
described above, one could also target CI-2A to other loca- 
tions such as the cell membrane, mitrocondria, lysosomes etc. 



WO 00/05406 



PC T/DK99/00408 



91 

EXAMPLE 4 

D ispl ciyi iiij CI--2A on piuicjo pa rticlos 

The phage display tochnoloqy has since its discovery been used 
extensively for screen inq of peptide 1 ibraries. Phage display 
5 can be used in combination with ("e.LIScreen™ to enrich the 

,~ -;.].- i ; v. - - i: ~ i * : - 

I — l L.:r . ' . ^ ^ i .i_ . » l w ^ . - l. i. ulit: LUli C A L J_ i C L S OX 

who I <■ cells and thereby reduce the number of pept ides needed 
to t>e handled in the biological screening systems. We there-- 
.fore tested the applicability of displaying C I - 2 A at the 

10 surface- of phaqe particles by insertion of CI-2A into the 
pFARbQ phngomid (Fig. A and Johansen et ai . (1995), Protein 
Eng. 10, pp 1063-10e7). The presence of CI-2A on the surface 
of pFAB60/CI-2A derived phage particles was verified using 
both an EL ISA assay and the subtilisin assay (Example l-o & 1- 

15 i). In the ELI SA assay, immobilized rabbit anti CI-2A polyclo- 
nal antibodies retained phage particles derived from 
pFAB60/CI-2A at levels several orders of magnitude higher than 
CI-2A negative phage particles (Fig. 5). Consistent with this 
result, subtilisin activity could only be inhibited by the 

20 pFAB60/CI-2A derived phage particles. These two experiments 
clearly demonstrate that CI-2A can be displayed on the phage 
surface and therefore fulfill the features necessary for a 
scaffold protein in both the phage display and the CellScreen™ 
technologies . 

25 The loop region of CI-2A will be extensively modified in the 
situation where a peptide library is presented by CI-2A. To 
mimic this situation and test whether phage particles presen- 
ting such modified CI-2A proteins could be produced we ex- 
changed 4 amino acids situated in the loop region with a 19- 

30 mer randomly composed peptide, thereby generating pFAB60/CI- 
2A_rc (Fig. A and Example 1-g). When analyzed by ELISA, a 
significant signal was obtained although it was slightly 
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decreased compared to the phage particles carrying the unmodi- 
fied CI-2A (Fig. 5). This variation could be due the presence 
of an important antibody recognition site in the loop region. 
In conclusion, the fact that phage particles displaying the 
5 modified CI-2A generated a signal comparable to that obtained 
for the pFAB60/CI-2A containing phage particles suggests that 
CI-2A can be displayed independently of the amino acid compo- 
sition in the loop region. 

EXAMPLE 5 

10 Constructions of CI-2A presented peptide libraries 

To facilitate the exchange of amino acids situated in the loop 
of CI-2A, recognition sites for Muni and Sail were introduced 
into the CI-2A cDNA in pCMVbipep/CI -2A by silent mutagenesis 
to generate pCMVbipep/muCI-2A (Example 1-e) . The presence of 

15 the cleavage sites enables a non-PCR based library construc- 
tion procedure- In this procedure, a synthetic oligo that 
includes the randomized region is converted into a double 
stranded form before cloning into the Muni/ Sail sites (Fig. 
6) . The feasibility of producing a peptide library using this 

20 procedure was tested by a small scale ligation followed by 

sequencing of a limited number of randomly chosen clones. Out 
of 8 sequenced clones, all contained insertion of the random 
oligo and none of the insertions encoded identical peptides. 
This suggests that that transfer of diversity from the syn- 

25 thetic oligo into a biologically active form is possible using 
this strategy. 

The use of the Muni site to generate peptide libraries limits 
the portion of the loop region in CI-2A that can become sub- 
stituted. To utilize the complete loop region for peptide 
30 presentation it is contemplated to create a complementary 
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peptide library by using the more 5' -located BamUL cleavage 
site situated in pCMVbipep. In this case, the DNA fragment 
that, contains the randomized region can be generated by either 
a non-l'CR based or a PCR based method as outlined in Fig. /. 
b This allows the '--'-border of the randomized region to be 
defined based on theoretical considerations. 

1 " c " L -" v " v - UMlU - L " LU ciuuing strategies allow construction of CI- 
2A presented libraries that diverge in the manner whereby the 
peptides are presented. Such different libraries are likely to 
complement each other regarding interactions with possible 
target molecules. Screening different types of libraries will 
therefore increase the number of possible target molecules 
identi f led . 



10 



EXAMPLE 6 



15 Discussion of CI-2A as scaffold in the CellScreen™ technology 

As demonstrated in Example 2, CI-2A can be expressed in a 
functional form in mammalian cells. Establishment of a func- 
tional system for displaying randomized peptide sequences 
using CI-2A as a scaffold is thus relatively uncomplicated to 

20 envisage. In order to direct the CI-2A scaffold to different 
compartments of the cell, retroviral vectors harboring diffe- 
rent leader sequences have been constructed. The data pre- 
sented in Example 3 illustrates that a defined localization 
for CI-2A can be obtained. Especially the nucleus and the 

25 endoplasmic reticulum are compartments where several specific 
reactions occur, such as transcriptional regulation and recep- 
tor folding. Such processes are obvious targets for peptide 
antagonists. The intracellular tolerance to CI-2A in mammalian 
cells and the capability of CI-2A to translocate to the nu- 

30 cleus and the endoplasmic reticulum makes it reasonable to 
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assume that CI-2A can be targeted to other compartments and 
intracellular organelles if desired. 



CI-2A is an extremely stable protein that has the advantage of 
being small, having no disulfide bridges, no glycosylat ion 
5 sites and a loop of eight amino acids that protrudes from the 
core structure (Macphalen C.A. et al. (1983) J. Mol. Biol. 
168, pp 445-447). It has been shown that insertion of 7, 9, 11 
and 13 residues between the Met40 and Glu41 (corresponding to 
the Met59 and Glu60 in the native molecule) in CI-2A have a - 

10 minimal effect on the stability and folding rates of the 

protein. Moreover, CI-2A has been found to fold through inter- 
actions of key residues in the C-terminal domain of the pro- 
tein, irrespective of the amino acids situated in the loop 
region (Osmark P. et al . (1993) Biochem. 32, 11007-11014, 

15 Ladurner A.G. and Fersht A.R. (1997) J. Mol. Biol. 273, pp 
330-337) . The loop therefore seems to be suitable for the 
insertion of random residues which is the purpose of the 
present invention. As described in example 4, substitution of 
4 amino acids in the loop region with a 19-mer randomly com- 

20 posed peptide did no significantly affect the capability of 

this modified CI-2A to be displayed on the phage surface. This 
result correlates with previous data showing that folding of 
the CI-2A core structure is independent of both the size and 
amino acid sequence of the loop region . 

25 One important feature of the present invention is that the 

peptides are selected based on a biological activity exerted 
inside mammalian cells. This implicates that the stability of 
the applied scaffold most be independent of disulfide bridge 
formation. Since no disulfide bridges are present in the CI-2A 

30 structure the stability and the folding rate of CI-2A must be 
independent of the redox potential of the solvent. The extrac- 
tion of active C1-2A from mammalian cells suggests that CI-2A 
is capable of adopting a native structure in the intracellular 
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environment, which i s t h e ma j o r c 1 eina rid t o a C ellScr e * 3 n™ scaf- 
fold. 

The N - 1 ( e r m . i. n a 1 10 amino a c: i d residue s < . 1 o n o t; h a v c a ny k n o w n 
function for the folding ot C \ ~ 2 A (Do Prat G.iy G. et al. 
5 Proc. Natl. Acad. Sci. 01, pp 10943-10946 and refe- 

r e n o e s h e r e in). As t h e y rn i n h t b o a b 1 e t < > p e r t o r in u n s p e c i £ i c 
i . ; l l ti l l i un uni ui<j s i ; i. eon ■_ n q ruin tar g e t is o I. - 1 1 r o n w o d e c i d e d 
to use the shorter 64 > < • s i d n • ■ version of C.I-.IA. Tn addition, 
the limited sixo increases the accessibility of ihe 
10 1 p t i c ] e / C I - 2 A p rot o i n t o V > \ r i d i n q p o c k e t ■.■ ; w h i ; :\ i rn i q h t i n c r e a s e 
1 1 io number ■ :~> t possible target s an d t h e r e b y t he 1 i k e 1 i h o o d for 
i s o lat ion o f pep t i d< • an t a a on i s t s . 

The ability of CI-2A to be exposed at the surface of phage 
particles was clearly demonstrated by the data presented in 

15 Example 4. The phage display technology allows selection of 
peptides that interacts with any immobilized component (s) . 
This could be crude cell extracts, receptor containing mem- 
branes or partly purified material containing the activity 
against which a peptide antagonist is desired. At present, a 

20 higher diversity can be handled by phage display due to the 
difference in the physical sise of a phage particle and a 
mammalian cell. By reducing the peptide library to a pool 
which only contains peptides that are capable of interacting 
with the potential target molecules the actual diversity that 

25 needs to be handled inside the cells, can be significantly 
reduced 

To be able to isolate the CT-2A scaffold - and thereby the 
target molecule to which it binds - from selected cells, a 
peptide tag will be fused to the N-terininus of the truncated 
30 CI-2A. We are currently considering the following tags: His- 
tag, Strep-tag or FLAG-tag. However, co-immunoprccipitat ion 
using an anti C1-2A antibody can also be used. Alternatively 
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to the biochemical methods, genetic approaches such as yeast 
or mammalian two- or three-hybr id systems will also be applied 
to identify the targets that interacts with the selected 
peptides . 
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CLAIMS 



o f 
[ ) (a) 

10 

(b) 

15 (c) 
(d) 

20 

(e) 



in e t h o (. 1 f o r ' id o n t i 1 y i n cj a n i / ] v 1 v o , i c 1 i v e rn o d u 1. a t o : " o f 
v L t y o f a t ( t r q e t on z y m e , t h o in e 1 1 i o d c : on i p r i s L a q t h c s t: c p s 

proparinq a pool of expression vectors, each vector of 
said pool consiiniin] at least one member tram a Library 
of randomly inodif led nucleotide sequences derived from a 
parent nucleotide sequence encod inq a parent peptide or 
pare n t r i bonu c 1 e i c acid w h i c \ \ m o d i j 1 a t e s t h e t a r q e t e n z y m e 
act ivit y, 

t ransf orminq a population o f subs rant lally ident ical 
ceils with said vectors of said pool so as to obtain 
transformed cells, said substantially identical cells 
beinq ones which harbour the target enzyme, 

eul tuning said transformed cells under conditions 
facilitating expression of said randomly modified nucleo- 
tide sequences, 

examining said transformed cells and isolating transform- 
ed cell (s) wherein the activity of the target enzyme is 
modulated, and 

identifying the modulator by determining said randomly 
modified nucleotide sequence of said vector present in 
cell (s) isolated in step (d) and/or determining the amino 
acid sequence or the ribonucleic acid sequence of the 
expression product encoded by said randomly modified 
nucleotide sequence . 
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2. The method according to claim 1, wherein the randomly modi- 
fied nucleotide sequences consist of 1) an invariable part of 
the parent nucleotide sequence, and 2) random nucleotides. 

3. The method according to claim 2, wherein the invariable 
5 part of the parent nucleotide sequence encodes a scaffold 

portion of the parent peptide or of the parent ribonucleic 
acid which serves to stabilize said polypeptide fragment or 
ribonucleic acid fragment. 

4 . A method for identifying a modulator in the form of a 

10 biologically active polypeptide fragment or ribonucleic acid 
fragment which is capable of detectably modulating, in vivo, a 
phenotypic trait in a cell, the method comprising the steps of 

(a) preparing a pool of expression vectors, each vector of 



15 



said pool containing at least one member from a library 
of randomly modified nucleotide sequences derived from a 
parent nucleotide sequence encoding a parent peptide or 
parent ribonucleic acid which in vivo modulates activity 
of a known enzyme, wherein the randomly modified nucleo- 
tide sequences comprise 



20 



an invariable part encoding a scaffold portion of 
the parent peptide or of the parent ribonucleic 
acid, said scaffold portion serving to stabilize 
said polypeptide fragment or ribonucleic acid frag- 
ment, and 



25 



random nucleotides , 



(b) 



transforming a population of substantially identical 
cells with said vectors of said pool so as to obtain 
transformed cells , 
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(c) culLurincj said transformed cells under conditions 
facilitating expression of said randomly modified nucleo- 
t ide sequences , 

(d) examining said transformed ceils and isolating transform- 
5 ed cell(s) wherein the preselected phenotypic trait is 

modulated thereby indicating that the expression product 

.^.^^ i. LJt j luvju l. y muu.L i iiucieoLiuo sequence is Dio lo- 

gically active, and 

(e) identifying the modulator by determining said randomly 
10 modified nucleotide sequence of said vector present in 

cell(s) isolated in step (d) and/or determining the amino 
acid sequence or the ribonucleic acici sequence of the 
expression product encoded by said randomly modified 
nucleotide sequence . 

15 5. The method according to any of the preceding claims wherein 
the substantially identical cells are prokaryotio cells. 

6. The method according to any one of claims 1-4, wherein the 
substantially identical cells are eukaryotic cells. 

7. The method according to claim 6, wherein the eukaryotic 
20 cells are selected from the group consisting of fungal cells, 

protozoan cells, animal cells, and plant cells. 

8. The method according to claim 7, wherein the animal cells 
are selected from the group consisting of mammalian cells, 
arthropod cells such as insect cells, avian cells, and piscine 

25 cells. 

9. The method according to any of the preceding claims, where- 
in the transformed cells examined in step (d) predominantly 
carries one single copy of the vector. 
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10. The method according to claim 9, wherein transformation 
step (b) is performed under such conditions that the cells 
transformed are predominantly or at most transformed with one 
single vector from said pool, or wherein, prior to carrying 
out step (d) , cells being transformed with more than one 
vector from said pool are substantially excluded from the 
further steps. 



11. The method according to any of the preceding claims, 
wherein the modulator is a peptide. 



10 12. The method according to any of claims 1-10, wherein the 
modulator is a nucleic acid fragment such as an RNA fragment. 

13. The method according to any of the preceding claims, 
wherein the modulator is stable towards proteolytic attack 
and/or is insensitive to a reducing environment. 



15 14. The method according to any one of claims 2-13, wherein 

the random nucleotides are introduced in part(s) of the parent 
nucleotide sequence which encode (s) the active site (s) of the 
parent peptide or parent ribonucleic acid, or the part(s) 
which encode (s) structure (s) interfering with the active 

20 site (s) . 



15. The method according to any one of claims 2-14, wherein 
the invariable part of the nucleotide sequence encodes trun- 
cated parts of the parent peptide or parent ribonucleic acid 
sufficient to maintain stability. 



25 16. The method according to any of claims 2-15, wherein the 
invariable part of the parent nucleotide sequence encodes a 
peptide which is free from disulfide bridges. 
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17. The method according to any one of claims 2-15, wherein 
the invariable part of the parent nucleotide sequence encodes 
a p e p Lid e h a v 1 n g d i s u 1 f i d e b r 1 1 1 q e s . 

1 3 . T h e me t h o d a c c o r d i n q to any o n e o f c 1 a i in s 2-17, w h o rein 
5 the r a n d cm nu c loot i d e s a r e i n t reduced in the f « "» r m of a n i n s cr- 
tion or a substitution into the parent nucleotide sequence, 
op Lioiidi i y in c oiub uidLiun w 1 l h ue 1 e l _l o i i ( s ; ui i.ii e pa it'll l 
nuc 1. <: o t ide sequence . 

19. The method according to claim 18, wherein the number of 

10 random nucleotides which are introduced is in the range from 3 
to about 100. 

20. The method accordinq to any one of claims 2-19, wherein 
the random nucleotides are nucleotide sequences and/or are 
single random nucleotides introduced at specific sites in the 

15 parent nucleotide sequence. 

21. The method according to any one of claims 2-20, wherein 
the random nucleotides are selected from the group consisting 
of 



synthetic, completely random deoxyr ibonucleo tides ; 



20 - 



synthetic random DNA sequences, wherein limitation on 



randomization of some nucleotides is introduced so as to 



25 



limit the number of available sequences and/or to avoid 
undesired stop codons and/or to facilitate introduction 
of post-translational modifications of expressed pepti- 
de (s) ; 



synthetic random DNA sequences as in (1) or (2) coupled 
to a sequence encoding a purification tag; and 
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- CDR encoding nucleotide sequences isolated from a library 

of immune-competent cells raised against an antigen. 

22. The method according to claim 21, wherein the CDR encoding 
nucleotide sequences encode CDR- 3 peptide sequences. 

5 23. The method according to any one of claims 20-22, wherein 
t he random nucleotides are prepared by random codon synthesis 
where defined DNA codons are synthesized in a random order. 

24. The method according to claim 23, wherein the relative 
amount of synthesized codons ensure that all encoded amino 

10 acids will be present with substantially the same frequency in 
the total of encoded polypeptide fragments. 

25. The method according to any one of claims 2-24, wherein 
the random nucleotides are introduced into the expression 
vector by the principle of site directed PCR-mediated mutagen- 



26. The method according to any one of the preceding claims, 
wherein the modulator in vivo reduces or increases K M of the 
target enzyme for at least one substrate. 

27. The method according to any of the preceding claims, 

20 wherein the modulator in vivo reduces or increases V max of the 
target enzyme for at least one substrate. 

28. The method according to any one of the preceding claims, 
wherein the parent peptide or parent ribonucleic acid is an 
inhibitor of activity of the target enzyme. 

25 29. A method according to claim 28, wherein the inhibitor is 
selected from the group consisting of 



15 esis . 
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a BPTI/Kunitz family protease inhibitor, a serpin family 
protease inhibitor, a Kazal family protease inhibitor, a 
soybean trypsin inhibitor (Kunitz) family protease inhibitor, 
a potato > inhibitor I family member, a Bowman-Birk family 
5 protease' inhibitor, a squash inhibitor family member, a 

wap-type ' Four-di su 1 f ide Core' proteinase i nhibi tor, a hirudin 
family protease inhibitor, a factor Xa inhibitor, an Ascaris 
.it'-—'' -.w^. « ^ ... « ^ ... — " j Li "-je : L '' J i (inn. i y piuuedbu 

inhibitor, a calpain family cysteine protease inhibitor, a 

10 tissue inhibitor of metal loprot einases family member, a 
carboxypeptidase A inhibitor, a metal .1 oca rboxypept idase 
inhibitor, an angiotensin-convertinq enzyme inhibitor, a 
cereal a lpha-amylase/ trypsin inhibitor family member, an 
alpha-amylase/trypsin inhibitor homoloqous to thaumatin, an 

15 alpha-amylase/subtilisin inhibitor family member, an inhibi- 
tors of insect alpha-amylases, an inhibitor of mammalian 
alpha-amylases derived from Streptomyces species, a trehalase 
inhibitor, a polygalacturonase inhibitor, a f ucosylt ransf erase 
inhibitor, a protein kinase C inhibitor, 

20 an cAMP-dependent protein kinase inhibitor, a cyclic nucleo- 
tide phosphodiesterase inhibitor, a protein phosphatase 
inhibitor, a TCD/MRS6 family GDP dissociation inhibitor, an 
ATPase inhibitor, a phospholipase A2 inhibitory protein, a 
ribonuclease inhibitor, an RNA polymerase inhibitor, a DNA- 

25 entry nuclease inhibitor, and a beta-lact amase inhibitor. 

30. The method according to any of the preceding claims where- 
in the substantially identical cells are mammalian cells and 
the vector is selected from the group consisting of a retrovi- 
ral vector, a vaccinia virus vector, an adenoviral vector, an 
30 adeno associated virus (AAV) vector, a herpes simplex virus 
(HSV) vector, an alpha virus vector, and a semliki forest 
virus vector. 
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31. The method according to claim 30, wherein the vector is 
ret roviral . 

32. The method according to claim 31, wherein the retroviral 
vector is derived from retrovirus selected from the group con- 

5 sisting of Avian Leukosis-Sarcoma Virus (ALSV) , Mammalian type 
C, Mammalian type B, and Lentivirus, and optionally modified 
with heterologous cis-acting elements. 

33. The method according to claim 31 or 32, wherein the 
retroviral vector has non-identical ends. 

10 34. The method according to claim 33, wherein the non-identi- 
cal ends contain non-identical promoters. 

35. The method according to any one of claims 31-34, wherein 
the retroviral vector contains a heterologous promoter repla- 
cing the viral promoter in the S'-LTR, such as a CMV promoter, 

15 an RSV promoter, an SV-40 promoter, a TK promoter, an MT 
promoter, or an inducible system such as Tet or Ecdysone. 

36. The method according to any one of claims 31-35, wherein 
step (a) is carried out by 

1) transfecting suitable packaging cells with vectors which 
20 comprise the randomly modified nucleotide sequences and 



which are integratable in virions produced by said pack- 
aging cells, 



2) 



culturing said transfected packaging cells in a culture 
medium under conditions which facilitate production by 
the packaging cells of virions containing the randomly 
modified nucleotide sequences, 



25 



3) 



recovering and optionally concentrating said virions, and 
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transducing said substantially identical cells with the 



vir ions . 



37. The method according to claim 36, wherein the packaging 
ceils are selected from the group consisting of PE501, Bosc23, 

5 H'2, G P f EO 6 , PhoenixEco, PA317, GP fAM12 , DA(ampho), Ring, 

FLY A 1 3 , ProPak, CRIP, WAM, Phoen ix-Ampho , PG13, H9 (293GPG), 
a no n. co t' a c k. . 

38. The method according to any one of claims 31-37 , wherein 
the virions are pseudotyped retrovirus produced by an ecotro- 

10 pic packaging cell line so as to confer broad tropism to the 
virions produced thereby, or wherein an ecotropic receptor has 
been introduced into the substantially identical cells so as 
to allow transduction with ecotropic virions. 

39. The method according to claim 38, wherein the ecotropic 
15 receptor has been introduced in the substantially identical 

cells by means of transduction. 

40. The method according to any of the preceding claims where- 
in the randomly modified nucleotide sequences are coupled to a 
nucleotide sequence encoding at least one fusion partner. 

20 41. The method according to claim 40, wherein the fusion 
partner serves to facilitate expression and/or purifica- 
tion/isolation and/or further stabilization of the expression 
product . 

42. The method according to claim 41, wherein the fusion 
25 partner includes a purification tag such as His6 tag, myc tag, 
BSP biotinylation target sequence, of BirA, flu tag, lacZ, and 
GST . 
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43. The method according to claim 40 or 41, wherein the fusion 
partner is a sorting signal or a targeting sequence. 

44. The method according to claim 43, wherein the sorting 
signal is a signal patch or a signal peptide. 

5 45. The method according to claim 43 or 44, wherein the sor- 
ting signal effects export of the expressed peptide out of the 
cell or into the cell membrane, or, when the substantially 
identical cells are eukaryotic, into endoplasmic reticulum, . 
into Golgi apparatus, into lysosomes, into secretory vesicles, 
10 into mitochondria, into peroxisomes, or into the nucleus. 

46. The method according to any one of the preceding claims, 
which further comprises the step of resolving the 3 -dimen- 
sional structure of the identified modulator. 

47. A method for the preparation of a replicable expression 
15 vector, the method comprising the steps of identifying a 

modulator by the method according to any one of claims 1-46, 
and subsequently 

i) isolating or synthesizing a nucleic acid sequence which 



20 ii) engineering a replicable expression vector comprising an 



encodes the modulator, and 



25 



operon which comprises, in the 5' -3' direction and in 
operable linkage, 1) a promoter for driving expression of 
the nucleic acid sequence, 2) optionally a nucleotide 
sequence encoding a leader peptide, 3) the nucleic acid 
sequence, and 4) optionally a termination signal. 



48. A method for the preparation of a transformed cell carry- 
ing a nucleic acid sequence encoding a modulator as defined in 
any one of claims 1-46, the method comprising transforming a 
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suitable host cell with an expression vector prepared accor- 
ding to claim 4 7 . 

49. A method for providing a modulator as defined in any one 
of claims 1-4 6, the method comprising 

5 I) growing a transformed cell prepared according to the 

method ui claim <i o m _ x culture medium under conditions 
which facilitates expression by the ceil of the randomly 
modified nucleotide sequence, and 
II) subsequently harvesting the expression product from the 

cell and/or the culture medium, or 
la) identifying the modulator according to the method of any 

one of claims 1-4 6, and 
lb) subsequently synthesizing the modulator by means of 

chemical synthesis on the basis of the sequence deter- 
15 mined in step (e) . 



10 



50. A method for isolating and/or identifying a target 
biomolecule, the method comprising providing a modulator 
according to the method of claim 49 and subsequently using the 
modulator as an affinity ligand in an affinity purification 
20 step so as to isolate the target biomolecule from a suitable 
sample . 



51. The method according to claim 50, wherein the affinity 
purification step is an affinity chromatographic step, an 
affinity mass spectrometry step, or a co-immunoprecipitation 

25 step. 

52. A method for isolating and/or identifying a target 
biomolecule, the method comprising providing a peptide modula- 
tor according to the method of claim 48 and subsequently using 
the modulator as a probe against a cDNA library derived from 



WO 00/05406 





CT/DK99/00408 



108 

the substantially identical cells or using the modulator as 
bait in a two- or three-hybrid system. 

53. The method according to any of claims 50-52, wherein the 
target biomolecule is a peptide or a nucleic acid. 

5 54. The method according to claim 53 further comprising the 
step of determining the amino acid sequence of the peptide or 
determining the nucleotide sequence of the nucleic acid. 

55. The method according to any of claims 50-54, further 
comprising the step of resolving the 3-dimensional structure 

10 of the target biomolecule. 

56. A method for selecting a chemical compound as a putative 
drug candidate in drug development, the method comprising the 
steps of 



according to the method of any one of claims 50-55, and 
selecting compounds which interact significantly with the 
target biomolecule . 

57. The method according to claim 56, wherein the library of 
20 chemical compounds has been provided by chemical synthesis 

upon initial identification of the members of the library by 
structure-based or non-structure based computer drug-model- 
ling . 

58. A method for the preparation of a medicinal product, the 
25 method comprising the steps of 



15 



assaying a library of chemical compounds for interaction 
with a target biomolecule which has been de novo isolated 



A) selecting a chemical compound by the method according to 
claim 56 or 57, 
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B ) per f o rmin g p r e -clinical t e s t s with the chemical c omp ound 
in order to assess the su i tab i 1 i t y thereof as a medi cinal 
P roeluct; , 

C ) enter ing , if the c hemic a 1 compound i s deemed suit abl e in 
s t. ' p ( B) , cl i n i on 1 t r i a Is us i ng the chomi ca I i:;<)mp>onnd in 
■ a < io r to obtain market aut h< a i' i za t ion 1 '3 r a medicinal 

or o d u c t in c 1 u d i n q t h e 1 e a d c omp o u n d a s a p h a r m a c e u t i c a 1 1 y 

a C l ± V O S U U l a nee, an (J 

D) upon a rant, of a market authorization, a dm i xing the chemi- 
cal compound with a pha rmaceu t i ca 1 1 y acceptable carrier, 
excipient cor diluent and marketing the thus obtained 
medic i na L produc t . 

59. A method for developing a medicinal product, the method 
comprising that a modulator identified according to the method 
of any -one of claims 1-4 6 serves as a lead compound in the 
drug development phase or wherein a target biomolecule iso- 
lated/identified according to any one of claims 50-55 serves 
as an interaction probe for the identification of putative 
drug candidates in the drug discovery phase. 



* 
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JC07 Rec'd PCFPTO 
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SEQUENCE LIGTTNC 



Ha 1 k i e r , Tor ben 
.'If e; per son , Lone 
Jensen, Allan 

• 1 ;s: • Novel Met heels for the T el en t i 1 i ea t i on oi" Liqanci and 
T a r c }( ■ t 13 l omo 1 e c u 1 o s 

« • ••• ' • ; • I i EC 1 

v I •! 1 : 

•. 1 m) A \ 

<17iV;- Patent in Ver. 2.1 



■ 23 1 • inisc feature 

• 222 ■ (2497 

-22 H ■ G is G or T 
-220 • 

^221 misc^feature 

■222 • (252) 

-.22 3 - A is A or C 

-.220 • 

-221- m i so_ feature 

■222- (279) 

•223- C is C or T 



< 21 ] o 4 51 

- 2 1 . : > DN A 



Hordeum vulgarc 



CPS 

(85) . . (339) 



- 2 20 • 

2 2 I • ma t pept ide 
-.222 • (88) . . (336) 



-■400*> 1 

cattaaactg atgacatgac agttcaagat ctcacagtca categgegat ctaatcagtc 60 
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t c a c a q q a a y cqaqcgtaac a a q q a t. q a q t t c a g t g q a q a a q a a c j c c q c j a q 111 

Met 5 e r S o r V a 1 G 1 u I . y s L y s I ' r o G 1 u 
- 1 1 5 

q c } a q t q a a c a c * c q q t: q o t . q q t q a e c q t c a c a a o c t". y a a q < i o a < j a q t q c } 15 9 
Gly Val Asn Thr Gly ALi Gly Asp Arq His Asn Lou Lys Th r: G La Trp 
1 0 1 f ; 7 0 

co a c j a q t t q q t q q q < j a a a tcq q t q q a q g a q q c c a a q a a q q t: q a 1 t c t q 2 0 7 
Pro Glu Lou Val Gly Lys Ser Va 1 Glu Glu Ala Lys Lys Val 1 Lc Leu 

2a 30 35 4 0 

caq a a < : a a q < ; c : a q a • j g c g c a a at c a t a g 1 1 o t q c c q q t q q q g , i c a a t t 2 5 5 
Gin Asp Lys Pro Glsu Ala Gin lie lie Val Lou Pro Val Gly Thr lie 
4 5 50 5 5 

q t: q cicc a t q qaa L a t c q q a t c q a c t : g o q t c c g c c t c t 1 1: qtc q a t a a a 30 3 
Val Thr Mot Glu Tyr Arq Ilo Asp Arg Val Arq Leu Pho Val Asp Lys 
GO 6 5 7 0 

c t c q a c a a c : a 1 1 q c o c a q y t c c c a a g q q t c g g o t a q c a c i q o t: L qa q 3 4 9 

Lou Asp Asn lie Ala Gin Val Pro Arg Val Gly 
75 80 

aqctagoctg ctqctqqcgt gtatgtattg cagcttcacc atctcttctt ggctatagca 409 

aqatt;qagat ttataaatca tatacaataa gagttgctgc gq 451 



•:::io> 2 
0211;- 84 
•:212> PRT 

•:213o Hordeuin vulqare 
-:4 00 - 2 

Met Ser Ser Val Glu Lys Lys Pro Glu Gly Val Asn Thr Gly Ala Gly 
15 10 15 

Asp Arq His Asn Leu Lys Thr Glu Trp Pro Glu Leu Val Gly Lys Ser 
20 25 30 

Val Glu Glu Ala Lys Lys Val lie Leu Gin Asp Lys Pro Glu Ala Gin 
35 4 0 4 5 

lie lie Val Leu Pro Val Gly Thr lie Val Thr Met Glu Tyr Arq lie 
50 55 60 

Asp Arq Val Arg Leu Phe Val Asp Lys Leu Asp Asn lie Ala Gin Val 
65 70 75 80 



Pro Arg Val Gly 
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3.1 tV \ 
31 i S7 

.: i s r-rJA 

S M ■ A r t I ( i m . i ) . l > o q u o n c e 

sso - 

S3 i 1'^scr.ipt ion of Artificial Soquonce: Synt hot. 'k' DNA 
\ 1 r i rne r 

■10.) • •: 

q iq itceut q.Kjq,u.Mgi;q qcc .:, i c j , i q 77 



ss i I ss 
SNA 

■■J. 1 - Artificial Sequence 
: S3 < ) • 

■ Inscription of Artificial Sequence: Synthetic DNA 
} -r imer 

■ .4i)ic ■ 4 

C'.iotoqaqtc agccqaccct ggggacct 28 



•.i'ln •• 3 

- Si I SI 

- 313s SNA 

SSI 3s Artificial Sequence 
oS Ssis 

v333> Description of Artificial Sequence: Synthetic DNA 
primer 

•Ol 0 OS s 

ctgtatetgg cggctccgtg g 21 



- 210> h 
-■211s is 
Ol2^ DNA 

••21 3 " Artificial Sequence 

v.;j;:o> 

<::::3> Description of Artificial Sequence: Synthetic DNA 
primer 

--■100 • 6 

aoagotggee ctcgcagac 29 



210> 7 
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: i : • ;:o 
] :•• • [>:ja 

' 1 ' v • /\ r t i f i o i a 1 S e q t i o n o o 

' ! 3 ~ - Poo, c r i p t I on o t A r t i 1 i o i a 1 Sequence: S y n t h o t" i o ON A 
pr i rno r 

*.()():■ 7 

:< :a< * t not t net gqet t a t 



'10 s 

A: ! 1 17) 

■:.a:a I iNA 

. ' 1 • A r t i f i c lal S e q u e n c; e 



Inscription of Art i ticial Sequence: Synthetic DNA 
f. • rimer 

• : ■! nTr - h 

tqqqqctgca cgtcatttg 19 



• 2 i. 0 • '< 

•:.:i.r- :-o 
212 • dna 

•21 3 • Artificial Sequence 

• ;•:•:() • 

-■'22'S'- Description of Artificial Sequence: Synthetic DNA 
t 'rimer 

MOO ' 

tqtgotacgg cgagtttggt 20 



7210- 10 

7211- 20 
■:212 > ONA 

-■2! 1.3* Artificial Sequence 
- 220 > 

•".223~> Description of Artificial Sequence: Synthetic DNA 
primer 

<.100> 10 

ggttcgtgaa aggctccatt 20 



310 > 11 

211. > 22 
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< 2i2> nriA 

• 2 1 3 : • A r t i f i c: i a 1 .So c \u o n c e 

- 220:- 

- 22 3> Description of Artificial SoqiKmci 1 : Synthot. ic DNA 

p £" i mo r 

' -if!0> .1 I 

< : . i , i a 1 : cj t t c,i ca.it. t a q l: cc t q 2 2 

■210* ] 2 

• 2 11 ■ (.0 

- 2 12 ' DIJA 

■ 2 1 3 ■ A r t i f i c i a 1 S o quon ce 

- 2 20 

■ 2 2 ?■ ■ Do s c r .i p t i o n o t" A r t i i i c i a 1 S e q u o n c; c : 2 y n t h otic DN A 

p r i mo r 

• 100 • 12 

<} jan. i to tat ggcqqccqca ecaaaaaaya aqaqaaaqqt aqqatccatg aagacagaqt 60 

v210> 13 
•211* 28 
-■.212^ DMA 

■:213^ Artificial So quo nee 

• 220 - 

o223^- Description of Artificial Sequence: Synthetic DNA 
primer 

—100- 13 

oyctcgagtc agccgaccct ggggacct 28 

--2Kr> 14 
<211> 30 
-.212 * DNA 

^213" Arti f icial Sequence 
<220* 

^22'.*> Description of Artificial Sequence: Synthetic DNA 
primer 

•-I00-- 14 

qaaqatctat ggactggatc tggcgcatcc 30 

<210> 15 
•:211 > 28 
^212 > DNA 
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<713> Art ilicial Sequence 

• 770 • 

• . : .' ' ■ nr ,( u.' r i pt ion o L Ar I i f i cia i Sequ^nc : Syn f her i o ON A 

p r i iaer 

—)()()■ 1 7 

c M q c j v i t a • r. , i ( f < i a t q a q c q < ; c : q ( ] t. a q c a q 2 8 



• 210 J 0 

• .•17 7NA 

•71 7 A r I i f i cial Sequence 
•.;7! 

- .77 7 ' • f'f 'scr i pt ion ol Ar ti 1 i cia 1 Sequence : Syn t he t i c DNA 
f-r i.mor 

• liir ■ 1 <■ 

el q*, ,it:ct qq cqqctccgtq q 21 



•;.:ln • 17 

• - 2 1 1 7 5 5 

• 7 17- r»riA 

•7:1 "7- Artificial Sequence 
-.270 

•7:2> * Description of Artificial Sequence: Synthetic DNA 
primer 

7.100 - 17 

ctaatctaga ctacagctcg tccttgtagt cctcgaggcc gaccctgggg acctg 55 



7210- 18 
-2711 ■• 7 C * 

7712 - dna 

7 2 17- Artificial Sequence 
<22 0:- 

•223 • Description of Artificial Sequence: Synthetic DNA 
} '.rimer 

■■■M00*- 18 

eggqatccat gaagacagagtggccagag 29 



7210 > 19 
<211 - 20 
7217 > DNA 

Artificial Sequence 
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< 220 . 

' 2 :> r [ i e a c r i p t i. o n o 1 A r t; i 1 i c i. a 1 S oquonco : S y n t h ( ? t i c ON A 
I > r i mo i 

< iocr i o 

('■qqrc: tat. t: go i a q cq qc 1' 0 



< '10 a 2 0 

• all. a 8 

< .. 12 DMA 

• .'13. Arti fioial Sequonci 1 

• a/0 • 

- ._' a 1 . J ■ 1 k ? a c r i p t i o n o I A r t 1 f i c i a 1 S o qu e n a o : S y a t. hot i a ON A 

p r i rno r 

- •iHC ■ .:() 

c a q a ■ g a t . g q c ] t a c a a 1 t q t a a c o a t g g 2 8 



• a" 10 • a 1 

• 2 1 1 • 2 1 

• 212 ■ PNA 

• .'la ■ Artificial Sequence 
■ 220 * 

• J J 2 ■ Description of Artificial Sequence: Synthetic DNA 

primer 

- 100 - 21 

ot.qtatctgg cggctccgtg g 21 



-,21D • 22 
■ 211. • 21 
-212 - DNA 

- 212 ■ Artificial Sequence 
< 220 > 

* 222 • Description of Artificial Sequence: Synthetic DNA 
primer 

-4 00- 22 

ctqtatctgg cggctccgtg g . 21 



<210"> 23 
•■.211 > 4 4 
-:212 > DNA 

■ 213 > Artificial Sequence 



220:> 
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• " 2 3 > Do ::. cription of Art i. 1 i c L a 1 f I c q u en c o : Synthetic DN A 
[■r into r 

■ r«n-- ; •• 

< ■ i . t q 1 1 1 q t o q acaaaq a q q c: q q a c q cq a t c q a t q c q a t: a 1 1 c c 4 4 



■;0> .-1 

11- : o 
: • i-ma 

' 1 • ; /• r t i f" i t * i . j 1 Sequence 
:20e 

2 3'- 1. 1 o s c r: i \z t i oi i o i A r t i t' i c i a 1 S oqu o n c e : Synthet i c DN A 
j ■ r i mo r 

!ik:> 21 

•i|(icct tat tcc^Kjcqqc 2 0 



21ne /3 

■ i ] ; - -0 

• I : i •ma 

• 2 1 3: Artificial Sequence 

■ 2 2 ( i 

•22 V- ['oscr.ipt.ion of Artificial Sequence: Synthetic DNA 
r 'rimer 

• Itm; • ,;S 

c c q = -} e c r. t. at. tccaagcggc 2 0 



•-2 10- 2 6 

• 2 1 L - .": i 
-.212^ UNA 

• 213 ■ Artificial Sequence 
-:220- 

•223"> Description of Artificial Sequence: Synthetic DNA 
primer 

<ioo-> 26 

ctqtatctgg cggctccgtq g 21 



• 2llT> 27 
v211^ 54 
•-212- DNA 

• 213*- Artificial Sequence 



220 > 

22 3_> Description of Artificial Sequence: Synthetic DNA 
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p r i mo r 
olOO^ 21 

a::, t.t.cctaqc t cjc, ica,i« -ca qcMatqqcai' L cjaoqacaqa it q a • m q ( 1 q } 5-1 



o i o: 28 

<:' l ! w 

:;■ l 2: i )NA 

■ \ : } ■ ArtilicuaL Sequence 
< .':'(! 

•'■ Description of Artificial Sequence: Syn t he t. .1 c DNA 
{ >r ime r 

<-iuu2 2R 

a;.,i<njaatqc qqcc q c q c o f j a c c c t q g q q a c c t q q q o 3 7 



i.a;- 2 9 
l:- 17 
DMA 

<:2 1 . .•;> Artificial So ^uence 
< :22 0> 

^ ' 2 2 3 " • Do scription of Artificial 
primer 

- 4CH.r- 29 

i.aioai:aggaa act a tqa 



Sequence: Synthetic DNA 



17 



*■■'."' 1 1.):'- 30 
•ill - lis 

DNA 

-a2 1 Artificial Sequence 

-:32 3:* Description of Artificial Sequence: Synthetic DNA 
primer 

•■■MOO** 30 

ctgecggtgg gtacaattqt gctgcgctac atggaccgcg caataqtgat gaacgtgaac 60 
attaqcgcac gcaaactacg gattgatcgc gtccgcctct ttgtcgacaa actcg 115 



O10 - 31 

<2ll^ 25 

•:?.12'> DNA 

•■:."-! 1 3 '•■ Artificial 



Sequence 



220 > 

323 > Description of Artificial Sequence: Synthetic DNA 
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■■lu()> 31 

■gaqittqtc qacaaagaqq cgqac 



* . • o - 

* 3 ir 20 

- .' 12: r»I3A 

* \ Artificial Sequence 



• 230:- 

- 2 3 3'- Do < ; t ; r: i p 1 1 on o f A r t i £' i c i a i S e qu o n ce : 3 yn t. he t i c DNA 

p r irno r 

- -1;hi:- 32 

1 i ■ r . < i c c q 1 1 1 q q r| t a c a a t: t: q 2 0 



3i0> 3 3 

..'11- 8 3 

.2 12 PITA 

'2 13- Artificial Sequence 



•■223'* Description of Artificial Sequence : Synthetic 
do generate ol igonucleotide 

- .10 0.* 33 

t.ctqocggtg ggtagaattc nnnnknnknn knnknnknnk nnknnkcgga ttgatcgcgt 60 
■ -cgcctct: 1 1 gtcgacaaac teg 8 3 



-.210** 3-1 
• 2 1 1 - 2 5 
v2l2> DNA 

<213^ Artificial Sequence 
*.220* 

- 223~> Description of Artificial Sequence: Synthetic DNA 
primer 

<A00> 34 

cqagtttgtc gacaaagagg eggae 25 



x210> 35 
<211> 12 
•-'21 .'!'■> PRT 

<213> Artificial Sequence 



^320"> 

v223> Description of Artificial Sequence: Fragment 



# 



• 
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c o nst i t u t inq n u c 1 e a i: 1 o c a 1 L z at. l t > n : ; i q n a L 



•']Q0> 35 

M-t AIj Ala Pro Lys Lys 
1 5 




10 



•5 1 p - 3 6 
-:21 I.- 21 

■ 2 1 ' 3 • A r t. i f i c i a 1 S o c \ u e n c o 
-:220 • 

• ; 2 2 3 - Do s c r i p t i on of Ar t i f i c i a 1. S o q u o r i c o : Fr a qme n t 
cons t i t ut 1 nq secretion .sic; rial 

•MOO > 3 6 

Hot Asp Trp llo Trp Arq Ilo Leu Phe Leu Va 1 Gly Ala Ala Thr Gly 



- 2J 0 - 37 
•21 1 -» 8 
-2120 FRT 

•:213> Artificial Sequence 
•'220:* 

-.223 - Description of Artificial Sequence: Fragment 
constituting retention sequence 

<4 00> 37 

Leu Glu Asp Tyr Lys Asp Glu Leu 



<210> 33 
o211> 2 3 
<2\2'> PRT 

<213> Artificial Sequence 
-:220-> 

<223^ Description of Artificial Sequence: Fragment 
constituting random insert 



10 



Ala Lis Ser Gly Ser 
20 



1 



•:400^ 38 

Val Leu Arq Tyr Met Asp Arg Ala lie Val Met Asn Val Asn lie Ser 
15 10 15 
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Ala Arq Ly:j hew Arq lie Asp 

2 0 



' 1 0: 

Ml. 

: 1 2: 



3 9 

r .7 8H 

ON A 



'1 Y> Art; : i i.c 



>equen i:;o 



cnp'. Ion ot Artificial Sequence: Hybrid 
eu 1 a r t.> 1 a. "in Ld 



tcqcgcgt tt 
caqct tqtct 

tT. qqoqqqtq 

accata t:qcq 
a t -.cqccU t 
tacgccagc t. 
tr tcccagtc 
qaq cccg tta 
c< ^ittgacgt 
cgtcaa tggg 
at (j cc.aagta 
ca gt aca tg a 
at taccatgg 

CqqggatttC 

caacgggac t 
cqtgtacggt 
cgatagactg 
aatcgtggtc 
gggtctt t ca 
ca ecgtcggg 
tgtgtgtgtg 
aetagatc tg 
ccctgggaga 
cgcggccgcc 
ctcggaga tc 
a aggccgg tg 
tgagggcccg 
tcgccaaagg 
cttgaagaca 
acaggtgcct 
cccagtgcca 
tat tcaacaa 
ggcctcggtg 
gaaccacggg 
cgagcgaccc 
ctccggccgc 
gctctgatgc 
ccgacctgt c 



cgqt qatgac 
gtaagcgga t 
-cqqggc tgg 
qtqtqaaa ta 
caggctgcgc 
gqcqaaaqgq 
a cgacgt tgt 
ca taact tac 
caataatgac 
t gqa gta t tt 
cgocccctat 
cct tatgqga 
tgatgcggtt 
caagtctcca 
ttccaaaatg 
ggqaggtcta 
agtcgcccgg 
tcgctgatcc 
1 1 tgggggct 
aggtaagctg 
tgccggcatc 
ta tctggcgg 
cgtctcagag 
atgtagtcta 
tgggccca tg 
tgcgtttgtc 
gaaacctggc 
aa tgcaagg t 
aacaacgtct 
ctgcggccaa 
cgttgtgagt 
ggggctgaag 
cacatgcttt 
gacgtggttt 
tgcagccaat 
ttgggtggag 
cgccgtgttc 
cggtgccctg 



qq t. qaaaacc 
gccqggagca 
c t taact atg 
ccgcacagat 
aactgttggq 
qqa tgtgctq 
aaaacgacgg 
ggtaaatggc 
gtatgttccc 
acggtaaact 
tga cgtcaat 
ctttcctact 
t tggcagtac 
ccccat tgac 
t eg taacaac 
t a a a a agggt 
gta cccgtgt 
t tgggagggt 
egtcegggat 
gccagcgatc 
tactttttgc 
c tccgtggaa 
qca teggggg 
gaacgcgttg 
cggccgcccc 
tatatgttat 
cct gtcttct 
ctgttgaatg 
gtagcgaccc 
aagccacgtg 
tggatagttg 
gat geccaga 
acatgtgttt 
t cct t tgaaa 
atgggatcgg 
aggctat teg 
cggctgtcag 
aatgaactgc 



tctqaoaca t 
gacaagcccg 
;:ggca teaqa 
gegtaaggag 
aaggqeqate 
:aaggcga 1 1 
:cagtgaat t 
ccgcctggct 
atagt aacgc 
gcccacttgg 
gaegg taaat 
tggcagt sea 
at caa tgggc 
gteaatggga 
tccgccccat 
aagaacccc a 
at ccaa taaa 
ctcct cagag 
ttggagaccc 
gttttgtctc 
gcctgcgtct 
gaactgacga 
ggggat ccag 
atcagt taac 
etaaegttae 
t t t ccaccat 
t gacgagcat 
tcgtgaagga 
t t tgeaggea 
tataagatac 
tggaaagagt 
aggtacccca 
agtcgaggt t 
aacacgattg 
ccattgaaca 
gctatgactg 
cgcaggggcg 
aggacgaggc 



gca get cccg 
tcaqqqcgcg 
gca ga t tqta 
a a c aa t acegc 
qqtqegggcc 
aacjttgggta 
etcegg aa 1 1 
ga ccge ccaa 
caat agggae 
cagtaca tea 
ggcccgcctg 
tctaeg tat t 
gtggata gcg 
gtttgttttg 
tgacgcaaat 
cacteggcgc 
gect 1 1 tget 
tgat tgactg 
ccgcccaggg 
cgtc tctgt c 
gatt ctgtac 
gt tegtatt c 
agct cgagct 
gaattcgaag 
tggecgaage 
attgeegtet 
tcctaggggt 
agcagt tcct 
gcggaacccc 
acctgeaaag 
caaatggctc 
ttgtatggga 
aaaaaacgt c 
ccgcgtgtgg 
aga tggattg 
ggcacaacag 
cccggt tct t 
agegeggcta 



gaga egg tea 
teagegggtg 
c t q a g a g t g c 
atcaggcgcc 
tettegctat 
aegecagggt 
ggctagecta 
cgacccccgc 
tt tccat tga 
agtgtatcat 
gcat tatgee 
agtcatcget 
gtttgactca 
gcaccaaaat 
gggeggtagg 
gccagtcctc 
gt tgcatccg 
cccagcctgg 
accaccgacc 
tttgtgcgtg 
tagttagcta 
ccgaccgcag 
t tgaaaaaca 
ggtcccaggc 
cgcttggaat 
tttggcaatg 
ct t tcccctc 
ctggaagctt 
ccacctggcg 
gcggcacaac 
tcctcaagcg 
tctgatctgg 
taggcccccc 
cctcgaacac 
caegcaggtt 
acaategget 
tttgtcaaga 
teg tggctgg 
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ccacqacy g c j c g 1 t cct t go g c a ( 3 c t g t g c 
q g c t q c r a t t; q q q <: ; q aaqtq c c g g g q c a q < j 
a q a a a q t.a.tr c a t < : a t q q c t q a t q c a a t q< • 
(]CCCii t f • •( i i cca coa aqcq a.iaca tcqca 
q t c t t < q L c c j a t: c a g q , a L g a r. c t q q acq a a q 
r - ( j e c a q q c t . - a a g q < : q ■■ q < : a t; q < : c c g acq 
cot: act: f qcc a -.a t a ' c i • a q t. a q a a a a r - j 
eac; qq H qt ?q."qqao- j<- tateaggaca 
aqet t.qq:qa • : q aat g. 1 q - t a a ccqcttc c 
cgcagcgear. cgc a t t. c r_ a t. cgcettcttg 
at a . 1 . a ^ , i aiaiaai a a a a a a r r t; t a :_ 
acca: t r. ca t. aagqc t.taqc cage t a act q 
acaccaq.jcjc : aat a :c < e r igaaaaacaa 
accg-jg acta qggeco i.jc.i ggatatctgt 
aag aacaga t gg t ccccaqa aacagagagg 
g a t; .a t c t g t g g tea .3 g c a c r_ a g g g c c c c g g 
a t. a ac i. i.i a 1 caaoaaca'^t t tcaagagac 
cegggqatea accccaagcc rcatttaaac 
cccgcgctta Ltgctgccca qctcta taa i 
gtcctccgat agactgagtc gcccqgqtac 
catccgaatc gtqqtctcgc tgatccttgg 
aggcatgeaa gcttggcgta atcatggtca 
ctcacaattc cacacaacat aegagcegga 
tgagtgagct aacccacart a a ttgcgttg 
ctgtcgtgcc agetgeatta atgaategge 
gggegctett ccgcttcctc gctcactgac 
gcggtatcag ctcactcaaa ggeggtaata 
ggaaagaaca tgtgagcaaa aggecagcaa 
ctggcgtttt tccataggct ccgcccccct 
cagaggtggc gaaacccgac aggactataa 
ctcgtgcgct ctcctgttcc gaccctgccg 
tegggaageg tggegcttte teaatgetea 
gttcgctcca agctgggctg tgtgcacgaa 
teeggtaact ategtcttga gtccaacccg 
gccactggta acaggattag cagagegagg 
tggtggccta actaeggcta cactagaagg 
ccagttacct teggaaaaag agttggtagc 
agcggtggtt tttttgtttg caagcagcag 
gatcctttga tcttttctac ggggtctgac 
attttggtca tgagattatc aaaaaggatc 
agttttaaat caatctaaag tatatatgag 
atcagtgagg cacctatctc agegatctgt 
cccgtcgtgt agataactac gataegggag 
ataccgegag acccacgctc accggctcca 
agggecgage gcagaagtgg tcctgcaact 
tgccgggaag ctagagtaag tagttcgeca 
gctacaggca tcgtggtgtc acgctcgtcg 
caacgatcaa ggcgagttac atgatccccc 
ggtcctccga tcgttgtcag aagtaagttg 
geactgeata attctcttac tgtcatgeca 
tactcaacca agtcattctg agaatagtgt 
teaataeggg ataatacege gccacatagc 
cgt t ct t egg ggegaaaact ct caaggatc 
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tccj a c q 1 1 q t; ct c t q a . 1 q c q c j j a a a q qa, : t 2 ''■ 4 0 

atctectqte atctcacctt; gctcctijcrc; 2 4(10 

yqoggctgea t. a cgc 1. 1 ga t ccggctacet 2 A t>0 

•• <~ q a q -qa a c a c qtact:q(j a t q q a a a c q 7 «77 

age a t. ca qgg gctcqcqcca gecqaactgt 22:- HO 

(jcqaqqat-pt; cgtcgtgacc catqacqatg 7 > 4 < ) 

gecqct t t t c Lggat tea r. c gactqtqqcc I! 7 in') 

tagcgttgge t a eeeg '.: ga t attqetqaaq 7 7 60 

tcqt g : { t t a cgqLatcqcc qct eeegat t 71-70 

a :.:qaq 1 1 ctt ct qao r taag acaitaqaaq 7 caa 

t can ! tt >ca a a \iaaa r ! eea '*t aiaae '''t-1 n 

e. aq La acq ee attttqeaaq gc. 1 t qqa id < a » a a 

gaa- iaq« jaa gtacaqagaq qctggaaaqt a a , ( > 

ggtcaagcac t ig jg.-cccg qec^agggae 7 1 70 

etgga lagta cegggactag qqcaaa : icag 2 1mm 

co 'age- i ' : > x agaaeaga tg jtcoccagaa 7»7-l0 

ccaqaaactq tcteaaggtt c.;c:aq a t qa 7 -iuO 

taa.;:ca.atea get^qettct cgcttetcqta 7-a.o 

.lag^jtaaga accocacacl eggegegeca 3 12 0 

ccgtgtar.ee aataaagect 1 1 tgctg tf.-"j 2A<-0 

gaggqtctcc tcctetgtcg c t eiq.acc t ge 7 7.10 

tagetgt t te etgtgtgaaa 1 1 gt tatccg 3f»00 

agcataaaqt qtaaageetq qgqtgcetaa 3 6r>0 

egeteaetge eegettteea gtegggaaac 77 20 

eaaegegegg ggagaggegg tttgegtatt 'MS0 

tegetgeget egg t cgt teg gctgeggega 38 4 0 

eggttatcea cagaatcagg gaataaegca 7 000 

aaggccagga jLCcqtaaaa-.-i ggeegegttg 3 0 60 

gacgagcatc acaaaaatc g aegetcaagt 4 020 

agataccagg cgttteeeee tggaagctce 4 080 

ettaceggat acctgtcegc ctttetccct 4 140 

cgetgtaggt atctcagtte ggtgtaggtc 4 2 00 

ccccccgtLc agcccgaccg ctgcgcctta 4260 

gtaagacacg acttatcgee actggeagca 4 320 

tatgtaggcg gtgetacaga gttcttgaag 4 380 

acagtatttg gtatctgege tetgetgaag 4440 

tcttgatccg gcaaacaaac caccgctggt 4500 

attacgegea gaaaaaaagg atctcaagaa 4S60 

gctcagtgga acgaaaactc acgttaaggg 4620 

ttcacctaga tccttttaaa ttaaaaatga 4 680 

taaacttggt ctgacagtta ecaatgetta 4/40 

etatttegtt catccatagt tgcctgactc 4 800 

ggcttaccat ctggccccag tgctgcaatg 4860 

gatttatcag caataaacca gccagccgga 4920 

ttatccgcct ccatccagte tattaattgt 4980 

gttaatagtt tgcgcaacgt tgttgccatt 5040 

tttggtatgg cttcattcag ctccggttcc 5 100 

atgttgtgca aaaaagcggt tagctccttc 5160 

gccgcagtgt tatcactcat ggttatggca 5220 

teegtaagat gcttttctgt gactggtgag 5280 
atgeggegae egagttgetc ttgcccggcg 5340 

agaactttaa aagtgctcat cattggaaaa 5400 

ttaccgctgt tgagatccag ttcgatgtaa 5460 
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cccacticaK] cacccaactq atcttcagca tct.t U acit tcaccaqcqt t tctgggtga 5520 

qoaaaaacaq (jaaqqcaaaa tgccqcaaaa aagggaataa gggcgaoacg aaaat gt tq<i 5 58 0 

at :t '-at ac: t.ct Lccttt t: t. caa t. alt a t tgaagcattt atcagggt ta I i;gtct;cat.g 5 64 0 

aa<g-Mt;aca tat.ttgaatg tat t. taqaaa aataaacaaa taggggt. tec qcqcacattt 5 700 

c • caaa,;ag tqccacctqa cgtctaagaa accatlatta t ca tgaoa 1 1: aa:cta t:aaa 5760 

a ! : .i : ii:al a tracgaggco etttegtc 5758 



< / I 0: 4 0 

< .11: 4 2 

<. .' i ^ Arta f icial Sequence 

< .V0> 

< : . - \:* rescript i on oi" Art i f i ci a 1 Sequence : Synthet i c DNA 

p> r i me r 

< -|:'0'- 40 

(Mt c^qqaat totceggaat tgqetagect agaqtcegtt acataact 



< :0> 4 1 

< .:il • 6 3 

< 212 - DNA 

< ■ Artificial Sequence 
- 220 - 

3 '■• Description of Artificial Sequence: Synthetic DNA 
primer 

4 00 - 4 1 

gaggactggc gcgccgagtg tggggttctt acccttttta tagacctccc accgtacacg 60 



^210- 4 2 

• 1 1 > 7 6 

• ;:i2 > dna 

2 13 • Artificial Sequence 

• 220- 

•■.223"- Description of Artificial Sequence: Synthetic DNA 
primer 

- '100 - 4 2 

aqatctccga ggcctgggac ccttcgaatt cgttaactga teaacgegtt ctagactaca 60 
tggcggccgcgtgttt ^ 6 



-210- 43 

2 1 1 - 4 4 

••212^ DNA 

■■ 213 Artificial 



Sequence 
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::2 0/-> 

'<r- Doscription of Artificial Sequence: Synthetic DNA 
prime r 

•KMr> 4 < 

q q q q q a t c c a q a < ; c : t c cj a q c 1 1 t*. q a a a a a c: a c q c < j q ccqc ca t q -1 -1 
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